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PREFACE 


The rapidly increasing number of students beginning graduate 
work are handicapped by the lack of books in English which 
provide readable introductions to important parts of mathematics. 
Nor is the diflBiculty met adequately by the slow method of lecture 
courses. 

Purpose of This book is based on the author^s lectures of recent 
this book years. It presupposes calculus and elementary theory 
of algebraic equations. Its aim is to provide a simple introduction 
to the essentials of each of the branches of modern algebra, 
with the exception, of the advanced part treated in the author^s 
Algebras and Their Arithmetics. The book develops the theories 
which center around matrices, invariants, and groups, which 
are among the most important concepts in mathematics. 

It is a te3rt for The book provides' adequate introductory courses 
several courses (i) higher algebra, (ii) the Galois theory of 
algebraic equations, (ui) finite linear groups, including Klein’s 
“icosahedron” and theory of equations of the fifth degree, and 
(iv) algebraic invariants. 

Higher The subject known in America as higher algebra is 
algebra treated fully in Chapters III~VI; it includes matrices, 
linear transformations, elementary divisors and invariant factors, 
and quadratic, bilinear, and Hermitian forms, whether taken singly 
or in pairs. While the results are classic, the presentauion is new 
and particularly elementary. Due attention is given to ques¬ 
tions of rationality, which are too often ignored. The unified 
treatment of Hermitian and quadratic forms requires but little 
more space than would be needed for quadratic forms alone. 
Elementary divisors and invariant factors are introduced in 
Chapter V in a simple, natural way in connection with the classic 
and a new rational canonical form of linear transformations; this 
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treatment is not only more elementary than the usual one, but 
develops these topics in close connection with their most frequent 
applications. It is then a simple matter to deduce in Chapter VI 
the theory of the equivalence of pairs of bilinear, quadratic, or 
Hermitian forms. We thereby avoid the extraneous topic of 
matrices whose elements are polynomials in a variable and the 
''elementary transformations^' of them. 

Algebraic Is every equation solvable by radicals? This question 
equations jg Qf absorbing interest in the history of mathe¬ 
matics. It was finally answered in the negative by means of 
groups of substitutions or permutations of letters. The usual 
presentation of group theory makes the subject quite abstruse. 
This impression is avoided in the exposition in Chapters VII-XI. 
Substitutions are introduced in a very deliberate and natural 
way in connection with the solution of cubic and quartic equa¬ 
tions. The reader will therefore appreciate from the start some 
of the reasons why substitutions and groups are employed. 
Portimately we are able to alternate theory and application in the 
further exposition of groups. The theory gives very simple 
answers to the following questions: Can every angle be trisected 
with ruler and compasses? What regular polygons can be con¬ 
structed by elementary geometry? 

Icosahedron, Klein's book on the icosahedron and equations of 
linear groups degree is a classic, but causes real diffi¬ 

culties to beginners on account of the inclusion of ideas from 
many branches of higher mathematics. Chapter XIII gives a 
simple exposition of the essentials of this interesting theory, 
which is a prerequisite to the subjects of elliptic modular func¬ 
tions and automorphic functions. The preliminary Chapter XII 
discusses the removal of several terms from any equation by 
means of a rational (Tschimhaus) transformation, and the reduc¬ 
tion of the general equation of the fifth degree to Brioschi's normal 
form, which is well adapted to solution by elliptic functions. The 
final Chapter XIV is a sequel to Klein's theory. It establishes 
remarkable results on the representation of a given group as a 
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linear group, and gives an introduction to Frobenius^s theory of 
group characters. The latter is an effective tool for finite groups 
and has led to important new results. 

Algebraic Chapters I and II provide an easy introduction to the 
invariants important subject of invariants. Hessians and Jaco- 
bians are shown to be covariants and applied to the determina¬ 
tion of canonical forms of binary cubic and quartic forms, as 
well as to the solution of cubic and quartic equations. Every 
seminvariant is proved to be the leading coefficient of one and 
only one covariant. It is shown that all covariants of any 
system of binary forms are expressible m terms of a finite number 
of the covariants. Such a fundamental system of covariants is 
actually found for one form of each of the orders 1, 2, 3, 4. Valu¬ 
able supplementary work on invariants is provided by Chapter 
XIII, which presupposes the concept of groups of substitutions 
explained in the elementary Chapter VII. 

There are numerous sets of simple problems, and a few historical 
notes. On pages 38,133,176,203, and 249 there are lists of topics 
for further reading, with references to writings in English. These 
topics are suitable for assignment to students for full reports at the 
end of the particular course. 

L. E. Dickson 

University of Chicago 
March 16, 1926 
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MODERN ALGEBRAIC THEORIES 
Chapter I 

INTRODUCTION TO ALGEBRAIC INVARIANTS 

Invariants and covariants play an important r61e in the various 
parts of modem algebra as well as in geometry. The elementary 
theory presented in this chapter will meet the ordinary needs in 
other parts of mathematics. It covers rather fuUy the subject of 
invariants and covariants of a homogeneous polynomial in x and 
y of degree < 6, with application to the solution of cubic and 
quartic equations. When the degree exceeds 4, most of the cova¬ 
riants are too long to be of real use, unless their symbolic repre¬ 
sentation is employed. 


1. Linear transformations. When two pairs of variables a:, y 
and i; are coimected by relations of the form 


x = + 11 ), y = e^ + di), 


D = 


a i 
c d 


^ 0 , 


these relations define a linear transformation T of determinant D. 
Consider another linear transformation U defined by 


| = eZH-/7, v = gX + hY, 


A = 


e f 
g h 


5 ^ 0 . 


The equations obtained by eliminating | and are of the form 
X = kX + lY, y — mX + nY, 


and define a linear transformation P which is called the product 
of T and U, taken in that order, and is denoted by TU. The 
values of the coefiSicients are 


1 
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fc ~ 06 + I — af + bhf m ^ ce + dg^ n — cf + dh. 

The determinant of P is found to be equal to PA, and hence is 
not zero. If we solve the equations of T and in the result replace 
xhy X and y by Y, we get 

ri - -P-icZ + P-ioF. 

These equations define a transformation called the inverse of T 
and denoted by Since the variables of are the same as 
those of Uj the product is found by eliminating { and 97 
and hence is rc = X, y — Y, The latter is called the identity 
transformation. It is readily verified that also T is the identity 
transformation. 

We shall next prove the associative law 
r?7. 7 = P • UV, 

which allows us to write TUV for either product. Let 
X — pu + qVf Y — ru + sv 

be the equations of 7. The product TU * 7 is found by elim¬ 
inating first f, 7} and then X, Y between the equations of P, ?7, 7, 
while the product P • UV is obtained by eliminating first X, Y 
and then tj between those equations. In each case we must 
evidently obtain the same equations expressing x and y as linear 
functions of u and v, 

2 . Forms and their classification* A polynomial like 
cKT® + 6 a? y + cxyz + dxz\ 

every term of which is of the same total order (here 3) in the vari¬ 
ables a;, 2 /, Zj is called homogeneous in x, y, z. A homogeneous 
polynomial is called a form. According as the number of variables 



§3] 


HESSIANS 


3 


is 1, 2, 3, 4,..., or q, the form is called unary, binary, ternary, 
quaternary ,..., or q-ary, respectively. According as the order of 
the form is 1, 2, 3, 4,, p, it is called linear, quadratic, cubic, 
quarUc,..., p4c, respectively. 

For example, the polynomial displayed above is a ternary cubic 
form, while a®* + bxy + cp® is a binary quadratic form. 


3. Hessians. The Hessian (named after Otto Hesse) of a func¬ 
tion/(*, p) of two variables is the determinant 



a*/. 

aa? 

dxdy 

av 


dydx 

ay* 


whose elements are second partial derivatives. For example, the 
Hessian of f = ai? + 2bxy -f- cj/® is 4 (ac — 6 *). 

The Hessian A of /(a:i,..., a:,) is the determinant of order q in 
which the elements of the ith row are 


( 2 ) 


av av 

-j - f • • • 7 -* 

dXi dxi dXi dX2 dXi dXff 


Let / become ^"( 2 / 1 ,.. -, 2 / 5 ) under the linear transformation 
(3) Xi — Ciij/i + ... + yq (i — 1,..., q), 

whose determinant is 



Cql * • • Cqq 


The product hA is a determinant of order q whose element in 
the ith row and jth column is the sum of the products of the 
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elements (2) of the ith row of h by the corresponding elements 
Cl,-, caCjj of the jtb column of A and hence is equal to the 
partial derivative with respect to of 


( 4 ) 


df df df 


df dXl ^ 

dxi dyj dXg dyi dy^ 


Let A' denote the determinant obtained from A by interchang¬ 
ing its rows and columns, whence A' = A. In the product A' • A A, 
the element in the rth row and jth column is therefore 

dxi dyj dXq dyj dy, dy] 


since Cir is the partial derivative of in (3) with respect to 
Hence 


A*A = 




dyr dyj 






Hessian of F. 


Thbobbm 1 . If F is the function obtained from f by applying 
any linear transformation of determinant A, the Hessian of F is 
equal to the product of the Hessian of f by A^. 


4. Befnition of invariants and covariants. Let a transforma¬ 
tion T of determinant A, expressing x and y linearly in terms of 
I and 1 ftj replace / = aa;® + 2hxy + cy^ hj + 25|i7 + Ci?®. We 
saw in §3 that the Hessian of / is the product of the discriminant 
ac — 6^ of / by 4. By Theorem 1, 

AC - = A^iac - 62 ). 


We therefore call the discriminant ac — 6* an invariant of index 2 
of /. We shall generalize this definition. 
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Consider the general binary form of order p, 

f = aox^ + ai y + - 

Let a transformation T replace f by 

F = Ao + -d. 1 97 4* • • • + Ap 97 ^. 

A polynomial I (ao,.. ., is called an invariant of index I of / 
if, for every transformation T of determinant A 9 ^ 0, 

I (Ao, • • •) Ap) = I (aof,», j ap)j 

identically in ao,..., Op, after the A's have been replaced by their 
values in terms of the a^s. 

Covariants K are defined similarly. If, for every transformation 
T of determinant A 5 *^ 0, a polynomial K in the coeflB.cients and 
variables of f has the property that 

jK:(Ao, ..., Ap; f, 7 ?) s A^iTCao,..., ap,* x, y), 

identically in ao, ... , ap, g, 97, after the A's have been replaced 
by their values in terms of the a's, and after x and y have been 
replaced by their values in terms of f and 97 from T, then K is 
called a covariant of index I of /. 

By Theorem 1, the Hessian of / is a covariant of index 2 of /. 
Note that / itself is a covariant of index zero of / since F = A°/- 
By a covariant of index Z of a system of forms /i,... is 
meant a function of their coeflSlcients and variables rci,..., ojg 
whose product by A ^ is equal to the same function of the corre¬ 
sponding coeflicients and variables 2 / 1 ,..., 2/3 in the forms 
Fi, ...,Ffc derived from /i, by applying the general 

linear transformation (3) whose determinant A is not zero. An 
example is given by Theorem 2 below. 

A covariant which does not involve any of the variables is an 
invariant. 
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S. Jacobians. The functional determinant or Jacobian (named 
after C. G. J. Jacobi) of q functions ..., ««),... 

..., Xq) with respect to the variables Xi,... ,Xg is defined to be 
th^ determinant 



Bfi 

3fi 

dxi 

dXz 

dXq 



a 

dXi 

dXq 



Theorem 2. The Jacobian of fi,... ,fq is a covariarU qf index 
unity of the system of forms fi,..., fq. 

Under a transformation (3) of determinant A, let /< become 
^iiVu 1 2/e)- Then (4) holds if we put the subscript i on each 
f and F. Hence the Jacobian of Fi,... ,Fq with respect to the 
variables j/i,..., j/* is equal to 


t 

4- /• 


..+-^c 

— Cu + • 
axi 

, V,. 




This determinant is the product of the determinant (5) by the 
determinant A = | |. 


Exercises 

1. lifi,... ,fqsee dependent functions, their Jacobian is identically zero. 
[Use/a = j(/i,... ,/j.j).] 

2. Two fonctioiis / and g of x and y are dependent if their Jacobian J ia 
identically zero. 

[Unless p is a constant function of /, we may take dg/dx not identically 
zero and hence employ g and y as new variables in place of x and y. Then 
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/ = F (ff, j/), and J reduces to — ^ ~ ] 

dx dy 

3. If/(as, y) is a binary form of order p, then (Euler) 


a/ , a/ . 

as— +j/— =p/. 
dx dy 


4. The Hessian of {ax + hy)v is identically zero. 
[It is sufficient to prove this for x^. Why?] 


6. Conversely, if the Hessian of a binary form f{x,y) of order p is identically 
zero, / is the pth power of a linear function. 

[The Hessian of / is the Jacobian of df/dXj df/dy. The latter are depend¬ 
ent by Ex. 2, Thus 


dx 



^ 0 , 


where a and h are constants. Solve this with Euler^s relation in Ex. 3, find the 
derivatives of log /, and integrate.] 

6. If the Jacobian of a quadratic form / and a linear form lia x and y is 
identically zero, / is the product of Z® by a constant. 

7. If a form fmq variables is expressible as a form in fewer than q variables, 
the Hessian of/is identically zero.i 

8. The Jacobian of any binary form / and its Hessian is a covariant of 
index 3 of/. 

9. / = ate® + 2bxy -h c?/® and p = pa?® -f 2qxy + rp® have the invariant 
s = ar — 2bq 4* cp of index 2. 

[The discriminant of / + is an invariant for every constant t.] 

10. A binary quadratic form / and a linear form I have a linear covariant 
J, their Jacobian. Show that the Jacobian of J and I is an invariant of index 2 
of the system / and Z. 

11. For /, Z, J in Ex. 10, prove that the points represented by / = 0 are 
separated harmonically by the points represented by Z « 0 and / = 0. 


6 . The discriminant of a binary cubic form. Write 

(6) /(a;, y) = arc® + Zhy? y + Zexy^ + dy^. 

^ The converse is true when q is less than 6, but not when q exceeds 4 (as believed 
erroneously by Hesse). See Gordan and N5ther, Math. Annalen, 10,1876, 664. 
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Its Hessian is 36A, where 

h = m? 2sxy r = ac — 2s = ad — he, 

t^bd- cK 

Under any linear transformation of determinant A, let / become 

(8) F = + 3Be n + + DyK 

Denote the Hessian of F by 36H. Since ft is a covariant of index 2 
of/, 

H = R^ + 2S^v + r*?* = A2 ft, 

R ^ AC - 2S = AD ~ BC, T = BD - CK 

Hence A* r, 2A* s. A® t are the coefficients of a binary quadratic 
form which our transformation replaces by one having the coeffi¬ 
cients iS, 2S, T. Since the discriminant of a binary quadratic form 
is an invariant of index 2, we have 

RT - AH - (A® s)2} = A« (rt - s*). 


Hence^ r< — s® is an invariant of index 6 of /. We shall call 
(10) - 4(ri - s“) = (ad- bey -4(ae-b^) (bd - c?) 

the discriminant of the binary cubic form (6). 

To justify this definition, we shall prove 

Theobem 3. The invariant (10) is zero if and only if f(x/y, 1) 
= 0 has a multiple root, i.e., iff (x, y) is divisible by the square of a 
linear function of x and y. 

For, if the latter be the case, we can transform / into a form (8) 
having the factor then C — D = 0 and the function (10) written 
in capital letters is zero, so that the invariant (10) itself is zero. 

^ This result and those of Exs. 8 and 10 of §5 illustrate the theorem that any 
invariant (or covariant) of one or more covariants of a system of forms is an in¬ 
variant (or covariant) of the forms. 
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Conversely, let (10) be zero, so that the discriminant of (7) is 
zero. Then A is the square of a linear function Apply a linear 
transformation which expresses x and y in terms of J and any 
function if independent of it. We now have (9) with h = 
Hence /S = 0 , T = 0. H D = 0 , then (7 = 0 (by T = 0 ) and ( 8 ) 
has the factor f®. If D 5 ^ 0, we have 

P 

5 = ^' A = -. F = + . 

In either case, / = jF is divisible by a square of a linear function. 

7. Canonical form of a binary cubic form; solution of cubic 
equations. 

Thbobem 4. Any binary cubic form can be transformed into 
if its discriminant is not zero, and into X^Y or if its 
discriminant is zero. 

Since the last statement has been proved, let the discriminant 
(10) of the form ( 6 ) be not zero. Then, if we discard the factor 
36 from all Hessians, the Hessian (7) is the product of two Imearly 
independent factors J, ij. Hence / can be transformed into a form 
F of type ( 8 ) whose Hessian (9) reduces to 2S^n, whence B = 0, 
T = 0,S?£0. H (7 = 0 , then H = 0 (by 12 = 0 ) and F = 

+ Drf, where AD 0 (by B 5 ^ 0). Taking 

f = A-i X, y = D-i F, 

we get F = Z® + F®, as desired. The remaining case (7 0 is 

readily excluded; for, then H 0 (by T = 0) and 

£2 C® 

A = -> D = ~, AD = BC, S =-- 0. 

C B ’ 

Hence to solve a cubic equation c(w) =0 without a multiple 
root, reduce the cubic form f{x,y) s j/® c {x/y) to the form 
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+ Dt?® by employing the factors 5 and ri of the Hessian of /. 


Exebcises 

1. The Hessian of / = + 6a;® 2 / + is 36 (d — 8) [xy + 2^*)* 

'When its factors { x -j- 2y and v - y are taken as new variables, / becomes 
? + W - ShK 

2. Tor d =* 9 in Ex. 1, x/y *= — 3, — 2 — w, or — 2 — where « is an 
imaginary cube root of unity. 

3. What are the roots when d » 7? 

4. The discriminant of x® — 27x2/® + ^ zero. 

3. Homogeneity of covariants of a system of forms. 

Theorem 5, A covariant which is not homogeneous in the vari- 
ables is a sum of covariants each homogeneous in the variables. 

Let the system of forms have the coefficients a, 6 ,, arranged 
in any order. If is a covariant of the forms, 

K{Ai S,. -.; f, 17,..= A^Kia^ 6,... ;aj, ...). 

■When a?, • are replaced by their linear expressions in 

17 ,, the terms of order ni on the right (and only terms of order 
ni) give rise to terms of order ni in ^ 7 ,... on the left. Hence, if 
Ki is the sum of aU the terms of order ni of we have 

^liAf R, • • • i Vi • • •} “ (o, hf • * * ] XfPf t * •), 

so that £1 is a covariant of the system of forms. Similarly, the 
sum Ki of all the terms of order ns of Z is a covariant. Hence K 
is a sum of covariants each homogeneous in the variables. 

Theorem 6. When a covariarU of two or more forms is not 
homogeneous in the coefficients of each fom separately^ it is a sum of 
covariards each homogeneous in the coefficients of each form sepor 
rately. 
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The proof is entirely similar to that used in the following case. 
Employ the notations of Ex. 9 of §5 and write d = ac — h^. 
Then s + 3d is an invariant of index 2 of the forms/and g. Let a 
linear transformation of determinant A replace / and g by 

F Ae + 2B^r, + Cv\ 6 ^ P^ + 2Q^ + Bn\ 

so that A, B, C are linear fimctions of a, b, c, while P, Q, B are 
linear functions of p, q, r. By hypothesis, 

AB - 2BQ + CP+ 2(AC - B^) = A*(s + 3d). 

The terms 3dA^ of maximum degree 2 in a, &, c on the right arise 
only from the part 3 (AC — B^) on the left. Hence d is itself an 
invariant of index 2; likewise s is itseH an invariant. 

In view of Theorems 5 and 6, we may and shall henceforth 
restrict attention to covariants which are homogeneous in the 
variables (of constant order n) and homogeneous in the coefficients 
of each form separately. 

Thbobem 7. Ld Khe a covariant of order n and index I of the 
forms fi,... tfrOf orders pi,..., printhe same variables xi,... ,Xq. 
Let K he of degree di in the coefficients of fi. Then YjVi ~ ^ 
= ffZ. 

Let/< have the coefficients Cij(j = 1, 2,...). Apply the trans¬ 
formation 

xx = myi,... ,X 3 = myg 

of determinant m«. Then/i becomes a form having the coefficients 
Cii = dj. Thus 

K(mri cij,..., m^r c,,-; mr^ Xi,..., m-^ x^) 

= (w®)* K(Cij ,..., Crj', Xxf.,,, ajj), 
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identically in m, xi,..., cij,,.., Crj, where ea,ch 3 has its 

range of values. We may remove the factor m“* and place the 
factor m-" in front of the left member. Since K is of degree di in 
thecj^d^ “ • • •)> 

which proves the theorem. 

If, when r = 1, we do not assume that is of constant degree 
di, but consider one term of degree di, we see that the proof gives 
Pidi — n = gl. Hence di is the same for all terms. This proves 

Theosem 8. A c(wanant of constant order n of a single form f is 
of constant degree in the coefficients of f. 

9. Weights of the coefficients of a covariant of a binary form. 
It is now necessary to employ the subscript notation for the coeffi¬ 
cients of the form 


. . / = no SB’’ •+• poi tcp-i y+ 

^ ^ Oai y* + - 

the binomial coefficients p, etc., being prefixed to avoid fractions 
in the later work. The weight of oj, is defined to be h and that of a 
product of factors Oi to be the sum of the weights of the factors. 

To make (6) conform with our present notation (11) for p = 3, 
replace a, 6, c, d by Uo, oi, 02 , os- Then the Hessian (7) becomes 


r = Oo flu ~ Oi*> 


36 (ra^ + 2 sxy + ty^), 
2 s = fflo fls ~ fli fluj 


t — an 


The terms of r are of constant weight 2, those of s of weight 3, and 
those of t of weight 4. Since the Hessian is of index 2, this illus¬ 
trates 
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Theorem 9. In a covariant of order n and index I, 

(13) E: =2^r(a)a?•“’■y^ 

r«0 

of a single form (11), dH terms of the coefficient Sr(a>) are of the 
same vmght l + r. In particular, the weight of the leader Saifl) of 
the covariant is egyal to the index 1. 

The transformation x = y = mi) replaces / by a form whose 
literal coefficients are Ai = m^ ai. By the definition of a covariant 
K of index I, we have 

(14) Z'Sr (A )!»■’■ V' = Sr{a)xf'-'-y\ 

r-0 

Let t = cao®o ai®i • • • ap« 2 > be any term of 5^r(a); the weight of t 
is tOr = + 2^2 + • • • + 'pep. The left member of (14) has the 

term 

cAo^o^i*! • • • Ap^p 7)^ - 

which must be equal to a term of the right member of (14). 
Hence Wr ^ r — L This proves Theorem 9. 

Each >Sr(<j) is isobaric, being of constant weight. If we attribute 
the weight 1 to a? and the weight 0 to y, we see that every term of 
(13) is of total weight I + n. 

CoEOLLAnY. A homogeneous polynomial is a covariant of f with 
respect to every transformation x — y — mri, if and only if it is 
isoharic. 


Exeecises 

1. Verify Theorem 9 for the covariant / of /. 

2. For an invariant of index I and degree d of one binary form of order p, 
pd = 21. Here I is also the weight of the invariant. 

3. Hence no binary form of odd order has an invariant of odd degree. 
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10 . SA minT flriants. A homogeneous, isobaric polynomial in 
the coefficients of a binary form/is called a sminvariant of/if it is 
an invariant of / with respect to all transformations of the type 

Tki x== ^ + 2/ = i;. 

Let Ao, Ai,... be.the literal coefficients of the form obtained 
from (11) by applying transformation Tk- Although not essential 
to our discussion, note that 

(15) Ao == Oo, Ai = oi + kao, As = Os + 2kai + oo. 

The first equation shows that Oo is a seminvariant of /. Eliminar 
tion of k between the last two equations gives 


Ao Ai ” Ai^ “ OtQ flo 


whence Oo og — oi* is a seminvariant of /. It is the leader of the 
Hessian of / for p = 3, as noted before Theorem 9. Evidently ao 
is the leader of the covariant / of /. These two cases illustrate 

Thbobem 10. The leader of every covariant K of any Unary 
form f is a seminvariant of f. 

The leader So (a) of any covariant (13) is isobaric by Theorem 9 
and is homogeneous by Theorem 8. Since E is a covariant of / 
with respect to every transformation Tk of determinant unity, 
we have 

K =25,(A)f‘-^i?'' =2/Sr(A)(a; - ky^-’-y^. 


A comparison of the coeflScients of iff* with that in (13) gives So (a) 
— 80 (A), Hence /So(a) is an invariant of / with respect to every 
transformation Tjo, This completes the proof of Theorem 10. 

By TkiX — ry — ^ — (r — k)ri. Hence each root Vi of f(x/y, 1 ) 
= 0 is diminished by A. Thus the difference of any two roots is 
unaltered. 
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Since Ai = 0 in (15) if we take h — ~ ai/ao, we see that the 
resulting transformation Tk replaces / by the reduced form 


gf = tto P + (2)^*2 + (3)631”"® 17® + ... 1 

where 


62 = 02 — OiVooj 63 = 03 — 3ai 02/00 + 2oi®/oo®. 


The roots of g = 0 are 

Ti + Oi/Oo = [(r< - ri) +-!-(»'<- rp)]/p 

(i = l,...,p). 

Hence each root of g == 0 is a linear function of the differences of 
the roots of / = 0 and thus is unaltered by every transformation 
Tk. The latter is true also of 62/00, 63/00, ..., which are equal to 
numerical multiples of the elementary symmetric functions of the 
roots of g = 0. Hence the homogeneous isobatic polynomials . 


( 16 ) i 


= fflo 62 “ ®0 U2 — ®1®> 

53 ~ Uo® &8 ~ ®0® Us — 3 Uo ®1 Us “I" 

54 = Uo® 64 = Uo® U4 — 4 fflo® Ui Os + 6uo Ui® 02 — Sui^ 


etc., to Sp are invariants of / with respect to all transformations 
Tk, and hence are seminvariants of /. 

Since g was derived from / by a linear transformation of deter¬ 
minant unity, every seminvariant S of f has the property 

S(ao,..., Op) = (S(uo, 0 , 62,..., hp) 

= S(ao, 0, Si/ao ,..., S’p/uo®’"^). 

Thbobem 11. Every semirwarimt of f is the quotient of a poly¬ 
nomial in the seminvariants ao. Si,Sp by a power of Uo. 

We shall presently apply this theorem to obtain all semin- 
variants when p < 5. Among them occur the leaders of all 
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covariants by Tbeorem 10. Our next problem will then be to find 
all covaidants K having a given leader 8, As an aid to its solution, 
we now prove 

Theorem 12. If S is a seminvariant of degree d and weight w of 
a Unary form f of order p, any covariant K of f having the. leader S is 
of known order n ^ pd ~~ 2Wi and at most one such K exists} 

By Theorem 9, the weight of aS is the index I of K, By Theorem 
7 with r = 1, ^ — 2, we have pd — n — 2L Hence the order n of 
K is uniquely determined by S. 

If there exist two distinct covariants /S a;" + • • * of /, they have 
the same index I = w, whence their difference is a covariant of / 
having the factor y. It is of the form yq{ao ,.,., a;, y). Let 

rf be any chosen linear function of x and y and select another such 
function f which is independent of rj. In view of the transformar 
tion of determinant A which expresses x and y linearly in terms of 
S and 7/, we have 

77g(Ao, i?) = A^yq(aQ,, .., ap,* aj, 2/). 

Hence yq is of order n and has as a factor every 17. This contra¬ 
diction completes the proof of Theorem 12. 

CoROij:iART. The weigh! of an invariant of degree d of a Unary 
form of order p is ^pd^ 


11. Theorem 13. Every Unary linear transformation is a product 
of transformations of the three types 


r„: 

i + nn, 

II 

S„: 

11 

y = mr](m 9^ 0); 

V: 

1 

li 

II 


From these we obtain “P = a; = 77, y = — f, and 

*That one such K always exists is proved in §16. 
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Rn - V-^ 7: x^X, y^Y + nX; 

Pm == 7”^ SmV: X — mXf y — Y (m 5 *^ 0). 

Then x — y + drjj D == ad — be ^ 0, is equal to 

Sd Pj>/d TbdfuRc/d (d 7^ 0 ), Sc P^b T^a/b 7 0 = 0 ), 

Applying the corollary to Theorem 9, and noting the effect of 
transformation 7 of determinant unity, we obtain the 

CoROLLAEY. A polynomial in ao,. .., x, y which is homo¬ 
geneous in X and y is a covariant of the p-ic (11) if and only if 
it is isobaric, is covariant with respect to every Tn, and is un¬ 
altered when X is replaced by p, 2/ by — x, and a* by (— 1 )* 
for i = 0, 1 ,..., p. Hence a seminvariant which is unaltered by 
the last replacements is an invariant. 

12 . Fundamental system of covariants of the binary p-ic for 
p < S. We shall exhibit a finite number of polynomial covariants 
jSli, ..., iiTn of the binary p-ic f such that every polynomial 
covariant of / is expressible as a polynomial in , Kn with 

numerical coeflicients. Then Kij .,. ,Kn are said to form afuvr 
damental system of covariants of /. 

We shall first exhibit certain seminvariants of / such that every 
seminvariant is a polynomial in them. Each of them will be seen 
to be the leader of a known covariant of /. Our problem is thereby 
solved, since no seminvariant is the leader of two covariants by 
Theorem 12. 

For p = 1, Theorem 11 shows that every seminvariant of 
/ = ao ic + 2 / is a polynomial in ao- Since ao is the leader of the 

covariant / of /, we have 

Theorem 14. Every polynomial covariant of f = aoX + a^y is 
a polynomial inf. Thus f itself constitutes the fundamental system of 
covariants of /. 
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For V — 2, Theorem 11 shows that every seminvariant is the 
quotient of a polynomial P (oo, Si) by ao*. But S 2 is not divisible 
by Oo. Hence P is divisible by Oo only when all of its terms have 
the explicit factor Oo- Cancelling the factor Oo from both P and 
Oo®, we reduce the problem to a like one having e — 1 in place of 
e, and finally to the case e = 0. Hence every seminvariant is a 
polynomial in oo and Si. Since oo is the leader of tibe covariant 
f of /, while Sj is an invariant of / by §4, we have 

Thboubm 15. Ev^ polynomial covariani of iJie Unary quadraUc 
form f is a polynomial inf and Us discriminani Si. 

A different treatment is necessary when p > 2 since there are 
seminvariants which are not polynomials in ao and the S{. Write 
Si for the terms of free of Oo; by (16) 

(17) S 2 = - oi^, Sa = 2 oi«, S 4 = - Soi*. 

For p = 3j 45«* + Ss* s 0 . We find that 

(18) W + Ss^ = ao^D, 
where, in accord with ( 10 ) and ( 12 ), 

(19) D = ao®03*- 6 ooaiaaas + 4ooa»* + 4oi*aj -• 3oi*d»* 

is the discriminant of the cubic form/. By means of (18) we may 
eliminate S 3 * and higher powers of So from any polynomial in 
Oo, &, Si and conclude from Theorem 11 that every smuinvariant 
is of the form P/oo", where P is a polynomial in ao, Si, St, D of 
d^ree 0 or 1 in S 3 . If e = 0, P is a seminvariant. Henceforth 
let e 2 1. bn case every term of P has the explicit factor ao, we 
caned it from P and ao* and hence reduce the probl^ to a liTra 
one having e — 1 in place of e. It remains only to treat the case in 
which not every term of P has the explicit factor ao. Then the 
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expression for P in terms of ao, ai, (h, as is not divisible by a© 
(i,e., P does not have the implicit factor ao). For, if so, then 

P' s P(0, Ss, Ss, d) s 0, d = 4ai« as - Sai^ OzK 

Since as occurs in d, but in neither Ds nor Ss, we conclude that P' 
is free of d. The first power of Ss == 2 ax® is not cancelled by a poly¬ 
nomial in S 2 ~ — ai^. Hence P' is free also of Us and therefore 
of 22 . 

Hence every seminvariant of the cubic form/is a polynomial in 
ao, S 2 , Szj D. They are connected by the syzygy (18), i.e., a rela¬ 
tion not serving to express any one of the four as a polynomial in 
the remaining three. 

Theorem 16. A fundamental system of polynomial covaiiants 
of the binary cubic form f is given by fj its discriminant D, its Hessian 
36H, and the Jacobian 3G of f and H. They are connected by the 

syzygy 

( 20 ) + 

This follows since ao, S 2 , Sz are leaders of the covariants/, H, (?. 

For p = 4, we see from (17) that the simplest relation is 
322^ + 24 = 0 . We find that 

Si -|“ 3;S2^ ^ ao^ If I = ao Oi — 4ai as + 302^. 

We employ this identity to eliminate Si. Consider polynomials 
P(ao, S2, Sz, I) having ao as an implicit, but not explicit, factor. 
Such a polynomial is given by (18). For ao = 0, D = —ai^I 
“ 22 1; we find that 


S2I -- D ^ aoJj 

J ao 02 a4 — ao os^ + 2ai 02 os — ai^ a4 — 02®. 
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Elimination of D between this identity and (18) gives 
(21) oo® J - Si I + W + Si^ s 0. 

The latter enables us to eliminate Sa® and higher powers of Si 
from P to obtain a polynomial Q in ao, Si, Ss, I, J of degree 0 or 1 
in Si. If possible, let Q have Oo as an implicit, but not explicit, 
factor. Then 


$(0, — ap, 2ai®, Soa® — 4ai aa, J) = 0." 

In view of the term — Oi® 04 of Q cannot involve J, and similarly 
not I. Nor can Q be of degree 1 in Ns in view of the odd power Ui®. 
Hence Q is free also of Sa and therefore of Si. 

Hence every seminvariant of the quartic / is a polynomial in 
do, Si, Si, I, J. They are connected by the sy^gy (21). 

Denote the Hessian of f by 144H, and the Jacobian of / and H 
by 8G. Then ao, Si, and Sa are the leaders of the covariants /, H 
and G of/. By the corollary in §11, the seminvariants I and J are 
invariants of/, since they are unaltered when we replace ao by cn, 
<h by Oo, Oi by — Os, and as by — Oi, 

Thboebm 17. A fundamental system of polynomial covariants 
of the Unary guarUc form f is given iy f, Us invariants I and J, its 
Hessian 14AH, and the JaccMan 8G of f and H. They are connected 
hy the syzygy 

(22) fj -fHI + 4m + G^^0. 

13. Canonical form of a binary quartic form f; solution of 
quartic equations. We shall exclude the uninteresting case in 
which fix/y, 1) = 0 has a multiple root. Since there are four 
distinct roots, /(a:, 2/) is a product of four linear forms, no two of 
which have proportional coefficients. Taking any two of them as 
new variables J and ij, we may reduce / by a linear transformation 
to the form ^yq, where 
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Q = 9^ + 2h^ri + kri^. 

Then gk 9 ^ 0 since neither | nor 1 ? is a factor of q. Choose r, s, t 
so that 

^ = k/g, r + s = g, r — s = h/t. 

Then 

h + tvy - 

q = r(i + tvy + sii-tr,y. 

Introducing the new variables 

we see that / = takes the canonical form 

(23) Z^+ 74 + 6mZ2p. 

The presence of the arbitrary constant m is explained by the 
existence of a rational absolute invariant i.e., one of index 
zero, which therefore has the same value for / as for any form, like 
(23), derived from / by linear transformation. For, the invariants 
I and J are of weights (and hence indices) 4 and 6, so that P/P is 
of index zero. 

In the preceding proof of the existence of a canonical form (23), 
we made use of the roots olf{x/y^ 1) = 0. We shall now show how 
to obtain the canonical form without assuming that the roots are 
known. We have proved the existence of a number m and a linear 
transformation T of determinant A 5*^ 0 which expresses X and Y 
linearly in terms of x and such that (23) becomes the given 
quartic / under this transformation T. To compute the values of 
1 and J jfor (23), take ao = 1, - 0, 02 = m, as = 0, a4 = 1; 
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then I and J have the values 1 + 3m* and m — m® respectively. 
Since I and J have the indices 4 and 6, we get 

/ = P + 6mX*r*, 

I = A*(l + 3m*), J = A®(m — m*), 

where I and J refer to /. The Hessian of (23) is 

^JP + mF* 2 mXY 

|2mZF F* + mZ* ■ 

Hence if the Hessian of/ is denoted by 144ff, a covariant of index 
2 , we have 

H A*{m{X* + F*) + (1 - 3m*)Je* F*}. 

We first compute a root A* m of the cubic equation 

(24) 4(A* m)» - /(A* m) + J = 0. 

Then A* = I — 3 (A* m)*. Choose either square root as A*. From 
the value of A* m, we get the value of m. Eliminating 4- F* 
between the expressions for / and H, we get 

= A*(9m* - 1)X* F*. 

This gives XF by extracting the square root of a form which is a 
perfect square. For, if 9m* = 1,/ would be the square of Z* ± P, 
contrary to the hsTpothesis that there is no multiple root. From 
XY and /, we get P + P and then X ± F and hence X and F. 

Thbobbm 18. Every Unary guartic form f (x, y) not divisible by 
the square of a linear farm can he redvaed to a canonical form (23). 
We can find m and ihe linear functions X and Y of x and y by finding 
a root rof a cubic equation (24), whose coefficients are Uie invariants 
1 and J of f, and by extracting the square root of rf — H, where 144H 
is ^ Hessian of f. 
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We can express (23) as a product of linear factors by solving 

+ 1 + 6mz = 0, where z = X^JYK 

Theoeem 19. To solve any guartic equation q{w) ^ 0 not having 
a multiple root, reduce the qmrtic form f(x, y) = y^q{x/y) to its 
canonical form (23) and express it as a product of linear factors hy 
employing square roots of knoum numbers. To solve the auxiliary 
cubic equation (24), reduce the corresponding cubic form to one 
involving only the cubes of the linear factors of the Hessian of that 
cubic form ( §7). Hence we can solve any guartic equation by extractr 
ing a cube root and certain square roots of numbers determined by the 
invariants I and J and the Hessian of the corresponding guartic 
formf. 

For tables of invariants and covariants, see FaSi di Bruno, 
Theorie der Binaren Formen, Leipzig, 1881. 


Exercises • 

- 1. If / = 0, A 0, then Sm® - — 1. Show that 7 = 0 for the quartic 
— Sw^ — iTw — = 0 (which arises in the theory of inflexion points 

on a plane cubic curve). Find its quadratic factors. 

2. Find the three canonical quartios when / = 0. Apply to solve 
± cw - 1 ~ 0. 



Chapter II 

FmTHER THEORY OF COVARIANTS OP BINARY FORMS 

We shall here prove some fundamental theorems on invariants 
and covaiiants which were merely verified in the introductory 
chapter for binary forms of orders ^ 4. For example, we shall 
prove that all covariants are expressible as polynomials in a finite 
number of covariants. Suggestions for supplementary reading are 
made at the end of the chapter. 

14. Annihilators of covariants. Consider a binary form 

( 1 ) f = 'E,Q)aiXf-*y\ 

having prefixed binomial coefficients, as in §9. Under the trans¬ 
formation 

Tn. a: = 1 + y = V, 

let/become 

( 2 ) 

where the Ai are polynomials in oo,..., whose actual ex¬ 
pressions will not be needed. Differentiating (2) with respect to 
n, we get 

since ri = y is free of n, while % = x — ny. The total coefficient of 
ijf is 


24 
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in which the second term is to be suppressed whenj = 0. Since 


G-) = O-i)' 


we evidently get 
dAo 


(3) 


dA 


OJi. j , . 

= ^ = 3^i-l 0 = 1; • • • » p). 


dn dn 

Consider the polynomials 
k = K(ao,...,ap-,x,y), 
By (3), 


(^Aof • • • ^ Apj 


(4) 


dK ^SKdAi , dKd^ A., dK dK 

a*. ^ yl ' a f- ^ A ^ 


dn %dAidn di dn 


j-i 


dAi 




The polynomial h is said to be a covariant of (1) with respect 
to every transformation r„ (of determinant unity) if and only if 
K ^ k becomes an identity in n, x, y,aQ,, % after f is replaced 
by a? — ny, rj by y, and each Ai is replaced by its expression in 
terms of n, Ooj..., Op. Since k is free of n, K == k implies that 

?-»■ 

Conversely, the latter implies that K is independent of n and hence 
is equal to the value k to which it reduces when n = 0, since 
Tn is then the identity transformation x = y ^ ri, which re¬ 
places (1) by a form (2) with the same coefficients Ai = ai. The 
condition is therefore that the final expression in (4) be zero 
identically in n, x, y, ao,... 7 ctp. In particular, it must be zero 
when w = 0, whence 


rt 

Oi-y-.O, 


7-1 oaf 


(5) 
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identically m x, , ctp. Ftom this identity we evidently 

obtain an identity when we replace a;,..., Op by any quantities 
such as 7?, Ao,. -., Ap. Thus the final expression (4) is zero 
identically in the latter quantities. Hence Jk is a covariant of (1) 
with respect to every transformation Ynif and only if (5) holds. 
We shall then say that k is annihilated by the differential operator 

( 6 ) + 

The form (1) is unaltered if we interchange x and y, and inter¬ 
change a,- and Op-t for i = 0, 1,..., p. Hence & is a covariant of 
(1) with respect to every transformation 

x= y = 1 ) + nf, 

if and only if & is annihilated by the operator 




m 






+ Op 


—)■ 


By the corollary in §9, fc is a covariant of (1) with respect to 
every transformation Sm* ic = y = if and only if ft is 
isobaric. Since T^i jBi = Y, viz., x ^ — Y^y — X, it follows 
from §11 that every binary transformation is generated by the 

Yti, 


Theorem 1. A polynomial K^Oq, • • • > Qp; is a covariant 
of the binary form (1) if and only if it is isobaric and is annihilated^ 
by the two operators (6) and (7). 

Corollary 1. A polynomial in Oo,..., is an invariant of 
(1) if and only if it is isobaric and is annihilated by 12 and 0. 


^In compu^tions, it is simpler to employ the sin^e aimihilator (6). We apply 
the corollary in §11 with condition concerning T* replaced by the equivalent con¬ 
dition that the polynomial be annihilated by (6). 
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For example, the weight of an invariant I of degree 2 of Oo £C® 
+ 2oi icy + 021 /* is 2 by the corollary at the end of §10. Hence 
7 = roo 02 + soi*. Then 07 = 2(r + 8)00 oi s 0, 07 = 2(r+ s)- 
Oi 02 — 0. Either condition gives s — — r, I = r (oo 02 — a^). 

By the definition of seminvariant in §10, we have 

CoBOLLABT 2. A homogeneous, isobaric poljmomial in oo,..., 
Oip is a seminvariant of (1) if and only if it is annihilated by 0. 


15. Commutators. Express each of the annihilators 0 and 0 
of invariants in the following two forms: 


V 


7-1 


daj 


fc-o oaAs+i 


Then 


0 = E(p - 7 + l)aj = Z(p - *) tti+i 

,•-1 c7aj*_i 


00 = £io,_i[(p - j +1) +Z(p - feK+i . ]> 
00 = - *)o*+i[(* + 1 )t^ +Ziaj-1 ]• 

■- act*!-!-! Oflfc 


The terms involving second derivatives are identical. Thus 
00 — 00 involves only first derivatives and is called the com- 
mvitator (alternant) of 0 with 0. In the first terms of 00, write 
j = i + 1; iu those of 00, write = i — 1. Hence 

00 — 00= ^(i + l)a<(p — -— »' + 


= Z(p - 2i)a<—• 

im>Q OCti 
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If jS is a homogeneous function of oo,..., of (total) degree d 
and hence is a sum of terms of type 

t = cflo^o ai®! • • • {eQ + ei+ • • • + Cp = d), 

we readily verify Euler's theorem (cf. Ex. 3, §5): 

Ui — ^ dS. 
dai 

li Sis isobaric, it is a sum of terms t in which 

ei + 202 + • ‘ + pep ^ w — constant. 

Then 


p 


p 0^ p 

23 “ = 2 

i-o i-o 


'f. a-s 
la,- — s wS. 

i_o 3a< 


Thbosem 2. 7/ )S is homogeneous (of degree d) and isobaric 
(of weight w) in oo,, Op, then 

(8) (S20 — OQ)iS = nS, n = pd — 2w. 


This result is the case r = 1 of 


(9) (QO’’ — O'" Q)S s r(n — r + 8 . 

To prove the latter by induction on r, assume that it holds when 
r = k, and note that 08 is of degree d and weight lo + 1, so that 
when (9) is employed with 8 replaced by 08, n must,be replaced 
byn — 2. Hence 

(QO*+i - Q)8 s (flO* - 0* Q)08 + 0*(120 - 0 Q)8 

= k(n-2-k + l)0’‘-^08 + n0^8 
= (k + l)(n - k)0>‘ 8, 

and (9) holds for r = ft + !• .The induction is therefore complete. 
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Similarly, since OS is of degree d and weight w — 1, 

(10) (00’' - Q' OyS = r(- n - r + S. 

16. Existence of a covariant with a given seminvariant leader. 
In §12 we verified for p g 4 that every seminvariant of the binary 
form f of order p is the leader of a covariant of /. We shall now 
prove that a like result holds for any p. 

Lemma. If 8 is a seminvariant, not identically zero, of degree d 
and weight w of a binary p-ic, then pd — 2w ^ 0. 

Suppose on the contrary that S is a seminvariant for which 
n = pd — 2w ia negative. Since QS = 0, (9) gives 

(11) Q0^S = r(»-r + l)0’^i/S (r = l,2,....) 

and no one of the coefficients on the right is zero. We recall that 
the weight of OP exceeds that of P by unity. The maximum 
weight of a polynomial of degree d in Oo,..., Op is pd, which is 
the weight of ap^. Since 0’**"“+’- 8 is of degree d and weight pd 
+ 1, it is zero identically. Thus (11) for r = pd — w + 1 gives 
Qpd~v) g s 0 ^ Then (11) for r = pd — w gives s 0. 

Proceeding similarly, we get 08 = 0 and then 8 = 0, contrary to 
hypothesis. 

By Theorem 12 of §10, any covariant with the leader is of 
order n — pd — 2w. Give it the notation 

\ 

K = Sa” + 8i y + 82 p® + • • • + p“. 

The result of appl 3 dng the operator (7) to X is 
(08 - fifi)a:» + (O81 - 282)x«-^y 

+ -h (08„-i — n8n)xy”-^ + OSn p", 

which is zero identically in x and p if and only if 

(12) K = 8x” + 08x^-^ p + 10* /Sir"-® p® +-h 0" 8y”, 

15560 lISc Lib B’lore 

512,74 N26 
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and 0”+^ /S s 0. To show that the last condition is satisfied, note 
that (11) for r = n + 1 gives S ^ 0, Since 0^*+^ S is of 
degree d and weight = + n + — tt? + l and is 

annihilated by 12, it is a seminvariant by corollary 2 of §14. It is 
zero identically by the lemma since pd — 2TV = — {pd — 2iy) 
— 2 is negative. Hence (12) is annihilated by the operator (7). 

In the result of applying operator (6) to (12), the coefficient 
of is 

which is identically zero by (11). Hence (12) is a covariant by 
Theorem 1. 

Thboebm 3. There exists a unique covariant of a binary phic 
whose leader is any given seminvariant of the pnic, 

17* Hilbert’s theorem 4. Any set S of forms in Xi, _ fXn conr 

tains a finite number of forms Fi,.., ,Fk such that every form of the ‘ 
set can be expressed as a linear combination of Fij... ,Fk with 
coefficients which are forms in Xi,, Xn, but are not necessarily in 
the set S. 

For n = 1, 5 is composed of certain forms ci a;«i, cz x% _ 

Let Cm be one of the minimum exponents and take Fi = Cm x^m. 
Every form of the set 8 is the product of Fi by a factor of type kx*^, 
where c § 0, Hence the theorem holds with ft = 1 when n — 1. 

To proceed by induction on n, let the theorem hold for every set 
of forms in 71 1 variables. To prove it for the set £», we may 

assume that 8 contains a form Fq of (total) order r in which the 
coefficient of Xn^ is not zero. For, let/(a;i,..., a;„) be a form in 8 
which actually involves Xn. Then there exist numbers Ci,..., c„ 
(Cn 9^ 0) for which c = /(ci,- Cn) is not zero. The trans¬ 

formation 


iCf = 2/i + Ci (i = ^ n 1 )^ Xn = Cn yn 

t 
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is of determinaait c„ 5^ 0 and replaces/by a form Fo (yi,... ,yn) 
in which the coefficient of j/n*' is evidently c. 

Let F be any form of the set S. Dividing F by Fo, we get 
F ^ Fo P + R, where jffi is a form whose order in a:„ is < r. In 
B we segregate the terms whose order in is ejcactly r — 1 and 
have 

F = FoP + Mxn^-^ + N, 

where Jkf is a fonn in aii,..., Xn-i, while JV is a form in ..., 
whose order in avi is ^ r — 2. Each F uniquely determines an M. 

Since the theorem is true by hypothesis for the set of foims M, 
there exist a finite number Mi,... ,Mi of them (corresponding 
to certain fojms Fi,... ,Fi oi S), such that every M can be ex¬ 
pressed as 

M =fi Ml + - hfiMi, 

where the /j are forms in Si,..., Then 

F^FoP + N + x^'-^'ZfiMi, Fi = Fo Pi + Mi Sn-i -|- Ni, 

i-l 

whence 

F^FoP'+T,fiFi + B’, P' = P-'£fiPi, 

R' = N-Y,fiNi. 

Each exponent of a:„ in 22' is ^ r — 2 ; we segregate its terms for 
which that exponent is exactly r — 2 and have 

F = FoP'+'tfiFi + M' + iV', 

where ilf' is a form in Xi,.,,, Xn^i, and iNT' is a form iaxi,..,, Xn 
whose order in a;„ is g r — 3. 

Since the theorem holds for the set of forms M', each is a linear 
combination of ikf'i,..., Mm, corresponding to certain forms 
Fi+i ,..., Fz+w of S, As before, F differs from a linear combinar 
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tion of Fo,..., Fi+m by M” + JV", where M" is a form in 
»i,..., Sn-ij and N" is a form whose order in a:„ is ^ r — 4. 
Proceeding similarly, we see that F differs from a linear combina¬ 
tion of certain forms Fo, •. •, Ft of S by a form lacking Xn- One 
more step leads to the theorem. 

18. Finiteness of a fundamental system of covariants. For 
simplicity, we first consider only invariants of a single binary 
form / with the coefficients ao,..., a,. By Theorem 8 of §8, 
every invariant of / is homogeneous in the o’s. By the preceding 
theorem, there is a finite number of these invariants Ji,. • . , fm 
in terms of which every invariant J of F is expressible linearly: 

(13) I ^ Fi7i-f •••-!- 

Let I and J# be of degrees d and d/, respectively. By the corollary 
at the end of §10, their wei^ts are |pd and fpd/. Hence Ej is 
of degree Dj — d — dj and weight 
We shall next obtain a differential operator D such that DEj 
is an invariant. Apply the operator 0^ to (10); we get 

0^10'-iS - O’--* Q'-OiS s r(- n-r + 1)0-^* 

In the second term replace QOS by its value OQS + n8 from 
(8). Thus 

Qr-l Qr 5 s Or -2 Qr-lf^Q _ (y _ 1) + r)]^(. 

Since 008 has the same degree and wei^t as 8 , we may employ 
the formula derived from this by repladng r by r — 1 to eliminate 
0»-® and by repetitions of this process evidently obtain 

0'-^ Q'Ss 0[0Q - (n -f- 2)][00 -2(n + 3)] 

•••[OQ- (/•-l)(n + r)]^, 

which we may ako establish by induction. Let S be of degree d 
and weight uo = so that its n in (8) is zero. Take r = w -{■ 1, 
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apply 0“+^ S s 0, and divide by (— 2) (— 2 • 3) ...; we get 
0 = QDS, where 


(14) 



ri_ 00 1 

1-2Jl 2-3-1 L 


Since DS is annihilated by 0 and is of constant degree d and 
constant weight w = ipd, it is a seminvariant by Corollary 2 of 
§14. Being the leader of a covariant of order n = 0, it is an 
invariant. Since the invariants I and 1/ are annihilated by both 
Q and 0, they are unaltered by the operator D. These results 
hold also if we annex factors g = 1 — cOQ at the right of (14), 
since gS has the sanae degree and weight as 8 . Hence if we take 
w to be the naaxinaum of the weights of the Ej, and operate on 
(13) by D, we get 


I ^ Jili+ - b Jmlm, Jf = DEi = invariant. 

Since each is of the foma (13), 

m w 

Ih} I = BjJi I j I 

i, 

Applying to the last an operator D with w sufficiently large, we get 

I^UL.khlk, 


where the Ljh are invariants of /. Since there is a reduction of 
degree at each step of the process, we ultimately obtain an expres¬ 
sion for I as a polynomial in Ii,.. •, Jm with numerical coefficients. 
Hence h,... ,Im form a fundamental system of invariants of f. 


Exercise 

Extend the finiteness proof to invariants of a system of binary forms 
oqX^i + ... j 6oajP 2 + ..., Co(c^'s + .,., ... iirst prove as in §14 that 
an invariant is annihilated by 



34 


COVAEIANTS OF BINARY FORMS 


[Ch.n 


and by 2) 0. Next, if iSi is a polynomial in the at, which is of constant 
degree di in the a's, of constant degree tig in the h’s,..., and of total weight w 
inthea’s, Vs, ... taJren collectively, show that (8) and (10) hold if we replace 
O by SO, tf by (S Q)’’, and« by Sp»d< ~ 2tfl. Make the like replace- 
naentsin (14). 

The finiteness proof is extended to covariants by means of the 

T.TiiMiwA. The set of (M homogeneotis covariants of the Unary 
forms /i,is ideniical with the set of forms derived from the 
invariants I Qiomogmeous in X, Y) of /i, •.., fh and L ^Yx 
— Xy ly replacing X by z and Yhy yin each I. 

To simplify the notations, let a,b,... denote the coefficients of 
f\,... ,fh arranged in any chosen order. Let A, B,... denote the 
corresponding coefficients of the forms obtained by the transfor¬ 
mation 

X = + gv, y = ru + so, D — ps — ^ ^0. 

This replaces L by Fm — Uv, in which 

V = sX-qY, V = - rX + pY. 

Solving these two equations, we get 

DZ = pC7 + qV, BY = rV + s7. 

Let I = I (a, b,X, Y) be of index 1. Then 
I(A, B,...-,U,V) = D^I = D*-”I(o, b,...,DX, BY), 
since 7 is of constant order n in X, F. Hence 
I{A,B,...;U,V) 

= j)i-n i{a,b,...; pV + qV, rXJ + sF), 


identically in U, V. We may replace the latter by u, v and get 
1(4, B,... ;u,v) = 7)*-» I(a,b,...;x, y). 
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This proves that I(a,b,... ;x, y) is a covariant of /i,..., /* of 
order n and index I — n. The argument may be reversed. 

Since every'covariant is a sum of covariants each homogeneous 
in the variables ( §8), we have completed the proof of 

THBOBiai 5. There exists a fundamental system of covariants of 
every set of binary form. 

The first proof of this classic theorem was made by Gordan by 
use of the symbolic notation. Cayley had earlier come to the con¬ 
trary conclusion that the fundamental system for a binary quintic 
is infinite, due to an error relating to the independence of the 
syzygies between the covariants. For a binary form of order <5, 
a finite fundamental system of covariants was constructed in §12. 

19. Finiteness of sy^gies. Let Ii ,be a fundamental 
system of invariants of the binary forms fi,. ■ . ,/r. Let 
S(zi,..., Zm) be a polynomial with numerical coefficients such 
that S(Ii,..., Im) is zero identically in the coefficients of the /’s. 
Then S{Ii,..., !„) = 0 is a syzygy between the invariants (see 
the examples in §12). 

By means of a new variable Zm+i, we may convert (zi,..., Zm) 
into a homogeneous form H by multiplying a suitable power of 
Zm+i into all terms not of the maximum degree. By Hilbert’s 
theorem, the forms H are all expressible linearly in terms of a 
finite number Hi ,..., H*, of them. Take Zm+i = 1. Thus 

(16) S = Pi^i-|- • • •Pj 

where Pi,..., P*, are poljmomials in zi,..., Zm. In this identity 
in «i,..., Zm, we may take zi = Ji,..., z* = /„. 

Thboebm 6. There is a finite number of syzygies Si == 0 ,..., 
Sk <= 0, such that every syzygy 5 = 0 implies a rdation (15) in 
which Pi ,... ,Pk are now invariants. 

In particular, every syzygy is a consequence of Si = 0,..., 
Sh ~ 0. By the Lemma in §18, the proof holds also for syzygies 
between covariants. 
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20. Canonical fom of a binary form of odd order. In §7 we 
saw that any binary cubic whose discriminant is not zero is the sum 
of the cubes of two linear forms which are the factors of the Hes¬ 
sian. We shall prove an analogous theorem for a form/ of any odd 
order 2n — 1. Let / have the notation (1) with p = 2n — 1. 
We seek constants pi, ri such that 

(16) / s Pi (a; -1- ri -f-- \rPn(p + r„ 

identically in x, y. The conditions are evidently 

(17) Oi = Pi ri< -1--H Pn (i = 0, 1,..., 2n — 1). 

We can deteimine solutions ffn, • • •, <?i of the n linear equations 

(18) a, Qn -f- O.+l -(-••• H- 0»+n-l Si + ®»+n = 0 

(s = 0, 1,1) 

if the determinant of their coeflScients is not zero. Let ri,..., 
be the roots of 

(19) z“ -I- Si z»-i -t--h ffn-i z + Sn = 0. 

If ri,..., r„ are distinct, conditions (17) with i = 0,1,,.., n — 1 
uniquely determine pi,..., pn. We shall prove that the remaining 
n conditions are then satisfied. To proceed by induction on s, we 
assume that the conditions (17) hold when i < a + n and prove 
that the condition with i = s + nia then satisfied. Multiply the 
condition with i = s by Sn, that with z = s 4-1 by q„-i ,..., that 
with z = s -f » — 1 by Sir that with z' = s 4* n by unity, and add. 
The corfdcient of p* is 

rt* s« 4- rk‘+^ Sn-i 4--i- Si + 

which is the product of r^* by the zero value of the polynomial 
(19) for z = rjj. The new left member is evidently (18). 

The equation having the roots n,... ,r„ is evidently obtained 
by eliminating Si,..., s» from (18) and (19). We prefer the 
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homogeneous equation in x and y whose linear factors arex + rty 
(t = 1,..., n). It is found by taking the determinant of the 
coefficients of gn,..., qi, 1 in 


/IQM 2/” Qn - r*"" 

V ^ yqi+ (— a?)" = 0 


and (18), and hence is 


yti 



... 

(-JC)* 

ao 


02 

• • • 

On 

ai 

02 

03 

On 

On4l 

an-l 

On 

On+1 

* • • 02n-2 

OSn-l 


This determinant is called the canonizant C of /. 



Theorem 7. Every sufficiently general Unary form of odd order 
2n — 1 is a sum of (2n — l)th 'powers of n linear forms. 


Exercises 

1. For n =* 3, the coefl&cient of x* in the canonizant O of the qnintic f is the 

invariant J of the quartio q — y + ..,, In particular, / is a 

seiuinvariant of q and hence is annihilated by the operator O in (6) vdth 
p =» 4. Since J lacks it is therefore annihilated by £2 with p = 5 and hence 
is a seminvariant of /. 

2. Show that J is a seminvariant of every / with p ^ 4 since it is a rational 

function of the seminvariants ao, See (21) of §12, and §10. 

3. There exists a covariant of order 3 of the qnintic / with the leader J. It 
coincides with C since the canonizant of the form derived from / by any linear 
transformation T has as linear factors the functions derived from the linear 
factors of C by T. 

4. Torn = 3, the canonizant C of / is the minor of p* in A: 


2/® 

0 

0 

0 


1 a? 0 0 

Oo 

Oo re 4- ai p 

oi aj + 02 p 

02 a; 4“ Os p 


0 p a? 0 

ax 

oix + a2P 

oa a; + 08 p 

03 a; 4- 04 p 

1 xy — 

0 0 p a; 

02 

02 a? + 08 p 

oa aj + 04 p 

04 a? 4“ 06 p 


0 0 0 p 


For, the product of C by D = p®, rows by columns, is A. 
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5. If a seminvanant S of tlie quintic / lacks ag, it is a seminyariant of the 
quartic q. If the wei^t of ^ is double its degree, S ism invariant of, q, 

6. Find the determinant whose vanishing is a necessary and suQ^cient con¬ 
dition that a binary form f of order 2n be a sum of the powers of n linear 
fornos. 

Hint: Employ (18) also for s =n, 

2L Ftirther topics on covariants. Reference will be made to 
elementary introductions in English to the following subjects: 

Seminvariants and covariants as functions of the roots.^ 

Symbolic notation; geometrical applications.^ 

Canonical form of the binary sextic and octavic.^ 

Functionally complete system of covariants of a binary form, 
found from a complete system of three linear partial differential 
equations,^ 

Theory of covariants under linear transformations with integral 
coefficients taken modulo p, a prime.® 

The coefficients of a unique canonical form are invariants.® 

Interesting applications of invariants of a quintic form are 
given in the elementary papers cited in §112, 

^Dickson, Algebraic Invariants, Wiley and Sons, 1914, pp. 53-68. 

^Ibid.f pp. 63—97. The Algebra of Invariants by Grace and Young is very difficult 
for students, except possibly the chapters on geometry. To readers familiar with 
German are recommended Gordan, Vorlesungen uber Invariantentheoiie, 1887; 
Clebsch-Linde mann , Vorlesungen Tiber Geometrie, 1,1876; II, 1891; and the up- 
t^ate book on Invariantentheorie by Weitzenb6ck, Groningen, 1923. The spe¬ 
cialist will need Meyer’s report on invariants in Jahreshericht der Deutschen Math.- 
Vereinigung, 1, 1890-2, 79-292. 

‘Elliott, Algebra of Quantics, 1896, 294-9; ed. 2, 1913, 285-90. 

^Dickson, Annds of Math., 25, 1924, 369-76. We may start with (18),'p. 373, 
noting that uf, »/, and wfsneOft G/» and (Q 0 — 0 0)/of our text. 

‘Dickson, On Invariants and the Theory of Numbers, the Madison Colloquium 
of the Amer. Math. Soc., 1914. Of. Glenn, Treatise on the Theory of Invariants, 
1915, 176-208. Dickson, History of the Theory of Numbers, III, 1923, 293-301, 

‘Wilczynsld, Proc. National Acad. Sciences, 4, 1918, 300-5. 



Chapter HE 

MATRICES, BnJNEAE FORMS, LINEAR EQUATIONS 

Chapters III-VI, which are independent of I-II, give a new 
exposition of the subject usually called hi^er algebra. We first 
develop Cayley’s calculus of matrices, and the essentially equivar 
lent subject of bilinear forms. The main theorems on the solu¬ 
tion of systems of linear equations are not presupposed, but are 
deduced as corollaries. 


22. Linear forms, matrices, linear transformations. Any homo¬ 
geneous linear function, such as 7® — Zy, is called a linear form. 
Consider m linear forms 


( 1 ) 


Uu + Oj2 J/S + • • • + <hn Vnt 


[ Zm = Uml yi + fflm2 ^2-1" ' ' * + Umn 

in the n variables yi, , yn- Write the coefficients of these suc¬ 
cessive linear forms in the successive rows of a table 


( 2 ) 


0>yi 

(Zl2 

• • CLin 

0>ml 

Oiw!^ • 

• • O/ffin 


Such a rectangular table composed of mn numbers an arranged in 
m rows with n numbers in each row is called a matrix with m rows 
and n columns, or briefly sum "X. n matrix. The mn numbers at, 
are called the elements of the matrix. We shall speak of A as the 
matrix of (the coefficients of) the linear forms (1). 

Examples of one-rowed matrices are the notations (x, y) and 
(x, y, z) for points in a plane and in space, the elements of these 
matrices being the coordinates of the points. 

39 
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Matrix (2) is said to be equal to the mXn matrix 


dll 

di2 

• • • din 

dml 

dm2 

• * * dfnn 


if and only if corresponding elements are equal: 

Uu = dll, ®ia “ dl2, * • • i Umn ^ dmn* 

We shall also write (1) and (2) in the compact notations 

n 

(1 ) If SS ^ ^ “ Ij • • • } 

/-I 

(20 A = (an) (z = 1,..., = 1,..., n), 

in which we exhibit the element a*,* which lies in the ith row and the 
jth column of the matrix (2). 

Suppose that the n variables yj are expressible linearly in terms 
of ^ new variables zi,.. ., z,: 

$ 

(3) Vi = S (i = Ij • • • I 

*-l 

Inserting these values into (10, we get 

n a » 

(4) 1% dii ^ih z* == 2 Zk (i = 1,..., m), 

j-l k^l Jb-1 

where 

« 

(5) yik “ 2-) jh (z Ij • • • ^ ^ Iz ^ 

Let B — (6,-fc) and P = (pu) denote the matrices of the linear 
forms (3) and (4) respectively: 
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6u ■ • • 6i, Pii • • • pi. 

(6) B = P = . 

hfU • ' * hjia Pml ‘ ' * Pm» 

We shall call P the product of and B, taken in that order, and 
shall write P = AB. In view of (5), we therefore have the 
following 

Bide of mvMplicaMon qf matrices: The element in the ith row 
and kth column of the product of a matrix A with n columns by a 
matrix B with n rows is the sum of the products of the successive 
elements of the ith row of A by the corresponding elements of the 
Ibth column of B. 

For example, 

/a _ /(ux + u/3 + bd\ 

\c d) \7 d) Vca 4-^7 C|8 + dd/ 

By a linear transformation we shall mean a system of n linear 
homogeneous equations 

n 

(7) Vi = 2 2* (i = 1, • • •, m), 

which express the variables J/i,..., linearly in terms of the 
same number of variables zi,... ,Zn. This transformation is said 
to have the matrix (6,*). Taking s = « in (3)-(6), we shall 
say that the transformation (7) replaces the system of linear forms 
(1) by the system of linear forms (4). This proves 

Thbobem 1. A linear transformation witJi the matrix B replaces 
a system of linear forms with the matix A by a system of linear forms 
with the matrix AB. 

When m = n, equations (!') define a linear tranrformation 

n 

= 2 Vi (i = 1,..., n), 


( 8 ) 





42 


MATEICES, BIUNEAE FORMS 


tch. m 


which expresses the variables Zi,..., Zn linearly in terms of the 
variables yi,..., Vn. We saw that the elimination of yi ,..., y™ 
between equations (10 (3) gSiVe (4) and (5), which, in our 

special case m ^ n = s, become 

Zf = Pik Zk} Vik ^ S (i, k == 1,. • • ) 71), 

jfc-i 

The resulting linear transformation, which expresses , in 

linearly in terms of 2 fi,..., s», and has the matrix AB^ is called 
the product of the linear transformation (8) with the matrix A by 
the linear transformation (7) with the matrix B, 

The one-to-one correspondence between linear transformations 
and square matrices is therefore preserved under multiplication. 
For this reason a linear transformation is often denoted by its 
matrix when the particular choice of the letters used to denote the 
variables is immaterial. 

The determinant \ ] having the same elements as the matrix 

of a linear transformation (8) is called the determinant of the trans-^ 
formation. Since the above rule of multiplication of square mar 
trices is one of the rules of multiplication of determinants, we con¬ 
clude that the determinant of the product of two linear transforTnations 
is equal to the product of their determinants. 

A linear transformation is called singular if its determinant is 
zero, otherwise nonrsingular. 

23. Inverse, identity, scalar, adjoint. Let a denote the deter¬ 
minant of transformation (8). If a 0, equations (8) have the 
unique solution 

(9) = 

where Akj denotes the cofactor of a*,- in a. Equations (9) define 
a linear transformation which expresses the linearly in terms of 
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the Va; it is called the inverse of transformation (8). Since it is 
immaterial what letters are used to denote variables, let us replace 
each Zi by 2 *. Then the product of (8) by its inverse 

(9') = (y = i, 

is evidently the identity transformation I defined by 

(10) j Zrt ~ 

The, inverse of (9) is evidently the initial transformation (8). 
Note that / = Si if « denotes 

(11) Ui == ivi,..., Wn = tVn. 

The product of (8) by in either order is 

n 

Z< = 2 Vi (i = 1, . .., n), 

i-l 

whose matrix will be denoted by either tA or At, being obtained 
by multiplying every element of A by t The matrix of (11) will 
be denoted by St and called a scalar matrix; its diagonal elements 
are t and all remaining elements are zero. In particular, the matrix 
of (10) is denoted by I and is called the identity matrix. Thus 
iSi == 7. Our results give 

(12) StA^ASt-- tA, lA^AI^ A. 

Hence in products we may replace a scalar matrix factor S t by the 
number t In particular, we may suppress a factor 7. 

A matrix all of whose elements are zero is called a zero matrix 
and designated by 0. By (12) with i = 0, we have OA = AO 
«= 0 for every matrix A. 
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The matrix of (9') is designated by 

All/a • • • 

(13) A-i = . 

Ai„/a A nn Ja 

and called the inverse of A. By the above results, 

(14) A-^A^L 
If we suppress the denominators a in (13), we get 

All * * * Anl 

(15) Adj..i= . 

Ain * * * Ann 

which is called the adjoint of matrix A. Hence the element A in 
the ith row and jth column of the adjoint of A is the cofactor of 
the element Uyi in the jth row and ith column of A. 

In case the determinant a of A is not zero, we have Adj. A 
= a A“^ by definition; then (14) implies 

(16) A(Adi. A) - aZ, (Adj. A)A = aL 

This result (16) holds true also when a = 0, as shown by con¬ 
sideration of continuity or by direct multiplication. 

24* Associative and distributive laws for matrices. Let A 
= (a*/), B = ( 6 i,), C = (cij) be any mXn, nX s, sXt 
matrices, respectively. By (5), the element [ik] in the ith row 
and fcth column of the product AB is X]? By the same 

rule, the element [il] of {AB)C is 

(17) 2(2 <^t3 

*-i ^ 3-1 

The element [jl] of BC is 2* Hence the element [il] of 

A(BC)is 
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Since this is equal to the sum (17), we have 

Theoeem 2. MtUMpUcaUon of mcdnces is assodative,^ i.e., 

(18) (AB)C = A(BC). 

Either of these products may therefore he denoted by ABC 
without ambiguity. It follows that ABC • D, AB • CD and 
A • BCD are equal and may be denoted by ABCD. Similarly 
for the product of any number of matrices taken in a prescribed 
order. The case of equal factors shows that A" is uniqudy defined 
when the exponent e is any positive integer. 

The sum (or difference) of two mXn matrices is defined to be 
the m X matrix each of whose elements is the sum (or differ¬ 
ence) of the corresponding elements of the given matrices. For 
example, 

/a i\ , fa p\ fa ± a b± j8\ 

\c d/ ~ \y df 'C ± y d + d) 

It follows at once that 

ril + B = B + il, (J. + 5) + C' = A+(5 + C), 
tB = t(A -f- B)f tA 4* utA **({”[" u)A, 

The dement [i&] of A (B C) is which is 

the sum of the elements [iA;] of AB and AC. In this manner, we 
obtain 

Theoeem 3. MidtiplicaHon of matrices is distributive, i.e,, 

{20) A {B±C) = AB± AC, (B ± C)ABA ± CA. 

^Another proof follows from §1. 
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In view of the associative law, the inverse of any prodwt of mor 
trices is the product of their inverses taken in reverse ord&r. For 
example, 


ABC • <7-15-14-1 = A • 55-1 . 4-1 ^ 44-1 ^ 

If 4 is non-singular, AX = 5 has the unique' solution 
X = 4 - 15 , while F4 = 5 has the solution Y = 54- 1 ! Since 
matrix multiplication is not commutative in general, X and Y are 
usually distinct, so that there are two kinds of division of 5 by 4. 


Exsbcises 


1. Scalar matrices are the only ones commutative with every n Xn matrix. 

2. Show that is the only matrix whose products with A in both orders 
gives the identity matrix. 

3. By multiplication of the matrices 


(01)* * “ (^ “ (-J i)’ (v^V^)’ 




verify thatfc. By the associative law, 

ifc as a - vzsijj kj) = i, 

ji ^Idi ^ h, jk — = i. 


Any linear combination of J, i, j, k with numerical coefficients is called a 
quaternion. The two quaternions 

g a/ + + q? + q' — al — bi — cj dk 

are called conjugates. Verify that 


q = 




a: = a + 6 V — 1 , 

2 / = c + d 1, 


and that gg' = g'g =* nl, where n =xx+^ la called the norm of g and 
written n(g). 
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4. The conjugate of the product of two quaternions is equal to the product 
of their conjugates taken in reverse order. For, . 



where r = xw — j/u, s = aw + 2 (w. To find the norm of gp, note that 
qp-p'q^ ^q*n(p)I ^qq^^n{v)I ^n{q)n{v)I. 

Hence the norm of a product is the product of the norms of the factors. 

5. In matrices (2) and (6), let the an, ha, pn be themselves matrices, such 
that those appearing in any row have the same number of rows, while those 
appearing in any column have the same number of columns. Prove that (6) 
holds also here. If each hn = 0 when i 9 ^ 3 , find the two products of (oi/) 
and (Jbij). 


25. Characteristic equation. Let A be an 72r-rowed square 
matrix whose elements are independent of the variable X. Let I 
be the ?2r-rowed identity matrix. The matrix A. — XZ is called the 
characteristic matrix of A; it may be obtained by subtracting X 
from each diagonal element of A, The characteristic determinant 
of A is the determinant of A — XJ and is a polynomial in X of 
degree ni 

(21) ^(X) s On X” + an-i X”-^ + • * * + aiK + cto, = (—1)”. 


The equation 0(X) — 0 is called the characteristic equation of A. 
For example, 


( 22 ) 

<^(X) = 


a — X 


c 



h 

d - X 


= X^ — (u -f- d) X + ud — be. 


If in any polynomial /(X) = 2 substitute the matrix 

A for X and multiply the constant term cq by 7, we obtain a matrix 
denoted by/(A). In the special case (22), we get 
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which is readily verified to be equal to matrix 0. This result is 
the case n = 2 of the important 

Thbobbm 4. Any square matrix satisfies its characteristic 
equation. 

Let (21) be the characteristic determinant of A. We are to 
prove that <^(A) = 0. Since the elements of AL — XT are linear 
fimctions of X, and liie elements of its adjoint C are (n — l)-rowed 
determinants, they are polynomials in X of degree ^ — 1. If 

the element in the ith row and jth column of C is c <,-4 X*, then 

c=E C7i X* Cj = (c<,») a, i = 1,..., 

;;-0 


Applying (16) wili A replaced by A — XJ, we have 


(A - }J)C = <i>(\)L 

Hence 

AE X‘ - XE O'* X* = E a* 

X:-0 ib-O 


Equating the terms free of X and the coefficients of X, X^,..., 
Xn-i^ X", we get 

A.Co ^ clq 2 f 

ACi — Co ^ CLi Ij 

— Cl ^ dz Ij 


ilCn—1 2 ^—l2} y 

- = ttn J. 

Multiply these equations on the left by I, A, A”’ 

respectively and add; we get 

Q ^ Ool aiA + a^A^ + ... 4 . + an A^ ^ 0 (- 4 .). 
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26. Rank of a matrix. Every matrix M having more than one 
element contains other matrices obtained from M by deleting cer¬ 
tain rows or columns or both. In particular, it contains certain 
square matrices. The determinants of these square matrices are 
called the determinants of M. 

A matrix is said to be of rank r if it contains at least one r-rowed 
determinant which is not zero, while all its determinants of order 
higher than r are zero. The zero matrix all of whose elements are 
zero is said to be of rank 0. 

For example, the rank of the matrix 

a. t c d e 

p q r $ t 

a h c d e 

p q r s t 

is 2 or less since every three-rowed determinant has two rows alike and is zero. 
The rank is 2 if a, b, c, d, e are not proportional to p, q, r, s, t, so that a two- 
rowed determinant is not zero. The rank is 1 if they are proportional and not 
all zero. 

By the rank of a determinant is meant the rank of its matrix. 

Theorem 6, If A is any matrix with n columns and B is any 
matrix with n rows^ any irrowed determinant D of matrix AB is equal 
to a mm of terms each a 'product of a Inrowed determinant of A by a 
irrowed determinant of B. 

Let A = (aij), B = (6^,), P = AR = (p^) be given by (2), 
(5), (6). For the case n — 3, < = 2, the theorem is illustrated by 
the formula 


P 12 Viz 


Oil 612 ”f" a^iZ &22 "t" 0^13 &82 ®11 &18 ’4" ^12 623 "t" ^^18 &33 

p22 p23 


(hi 612 “h Ct22 &22 "h ^^23 bs2 (hi &1S H*" ^^22 &23 ”1“ 0^28 &33 


021 O22 


612 &13 
622 & 2 S 


+ 


Ou Ui3 
021 (hs 


612 613 

&32 &83 


+ 


O12 Oi3 

O 22 dss 


622 628 

632 &83 
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Z)« 


i 

Vhh. •••Pnfcj 


n 

2 o<i/i • • 

n 

. 1 

VitH ‘ ‘ ■ Ptffc* 


n 

n 





Jl-1 



where ii <{2 < ‘ * <Jot- Since each element 

of the first column is a sum of n terms, D is equal to the sum of n 
determinants of which the jith has as elements of the first column 
theiith terms of those sums and has as elements of the remaining 
columns the elements of D. In other words, we may remove the 
summation sign from the first column and place it in front of the 
symbol of the resulting determinant. The common factor of 
the elements of the first column may be taken out as a factor of 
the determinant. Treating the other columns similarly, we get 






hh • * • 


A = 


' * ^HU 


Unless ji,... ,ie are distinct, A = 0. Select . •., from 
1, •.., n so that gx<g 2 < • • • Kg^ li ii = . ,it = gt^ 

A is a t-rowed determinant a of A. Next, let ji,... be an 
arrangement of ^ 1 ,. • •, gt which is derived from ^ 1 ,.. -, by i 
successive interchanges of two terms. Hence A may be derived 
from a by J successive interchanges of two columns, so that 
A = (- 1)^ a. The sum of the n\ products (— 1)^ • ■ ■ 6/^*^ 
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corresponding to all such arrangements ji,..., j«is, by definition, 
the expansion of the determinant 





which is a determinant of B. Hence H where the sum¬ 
mation extends over all the selections from 1,... ,n 

such that gi<gi < • • • <gt. 

CoBOLLABT. The rank of the product of two matrices cannot 
exceed the rank of either factor. 

For, if all t-rowed determinants of A (or of B) are zero, the 
same is true of all t-rowed determinants of AB. 

Thbobem 6. If A is any matrix vnik m rows and n columns and 
B is any nonrsingular, vrTowed, square matrix, then A and AB ha/oe 
the same rank. If C is any non-singidar, mrrowed, square matrix, 
(hen A and CA have the same rank. 

For, if r is the rank of A and p is the rank of P = AB, the 
CoroUary gives p ^ r and, when applied to A = PB~\ gives 
r ^ p, whence r = p. Next, if r' is the rank of CA, the Corollary 
gives r' ^r and, when applied to A = C-^ • CA, gives r ^ r', 
whence r' = r. 

27. Bilinear forms. A polynomial in the m + n variables 
»ij..., yi, •.., yn is called a bilinear form if each of its terms 
is of the first degree in the x’b and also of the first degree in the 
y’B. An example with m = 1, n = 2, is 7xiyi — 5xi y^. The 
general bilinear form is 

m n 

^ ~ iE]) Vi* 

i-l 7-1 


(23) 
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We may write 

m n 

a li anyj {i = 1,, m). 

i-l j-l 

The matrix A = (an) of these m linear forms h is called the 
matrix of the bilinear form (23). The coefficient an of Xiyj in a 
is therefore the element in the ilh row and jth column of the 
matrix A of a. 

By Theorem 1, the linear transformation (7) with matrix B 
replaces U by ^kPihZk, where (pik) = AB. Hence that trans¬ 
formation replaces a by 


z 


n 

Z Va 

*-i 


whose matrix is AB. 

H we do not alter the y’s, but apply the transformation 

m 

C24:) Xi — 'y j Cik (i = , Wl) 

of matrix C to a in. (23), we get 

m It ITS n 

52 5^ Z/j ~ 52 f* Vh 

Jfc«l y-1 

where dhi = 2* Ci* an. Hence is the sum of the products of 
the elements of the Xith row of matrix 


C'- 


Cli • • • Cml 


* * • Cmm 


by the corresponding elements of the jth colmnn of A = (an), so 
that the new bilinear form has the matrix C'A. Matrix C' is 
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called the transpose of matrix C, being derived from C by employ¬ 
ing the successive rows of 0 as the successive columns of C'. 

Thboebm 7. If in a bilinear form with the matrix A we subject 
the x’s to a linear transformation (24) with the matrix C, and the y’s 
to a linear transformation with the matrix B, we obtain a new bilinear 
form with the matrix CAB, where C is the transpose of C. 

We may write the bilinear form (23) as follows: 

n m 

OL = ^ ^ Xj y jy 'Kj — ^ dij x% 0* “ j 

y-i i-i 

The matrix of the linear forms Xi,..., Xn is the transpose A' of 
matrix A, Transformation (24) with matrix C replaces Xi,.. -, X» 
by linear functions of ..., f^ the matrix of whose coefficients is 
A'C (Theorem 1). Hence this transformation replaces a by a 
bilinear form with the matrix (A 'C )By the result preceding the 
theorem, this matrix must be equal to C'A. Since the transpose of 
A' is A, this process proves 

Theoebm 8. The transpose of a product of matrices is equal to the 
product of their transposes taken in reverse order. 

Exebcises 

1. Prove Theorem 8 by direct multiplication. 

2. Prove that by means of the transpose of « I. 

3. If AA' = J, then AA » L 

4. AnmXn matrix (aij) is of rank 0 or 1 if and only if there exist m + n 

numbers ci,..., d:,..., (4 such that aij = c< d,- (i = 1,..., w; y = 1, 

..?i). 

28. Fields. Two forms (each linear, bilinear, or quadratic) 
may be equivalent under a transformation with complex coeffi¬ 
cients, but not equivalent under one with real coefficients. Again, 
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two forms may be equivalent under a transformation with real 
coefficients, but not under one with rational coefficients. Here, 
as elsewhere in modem algebra, it is necessary to specify carefully 
the nature of the constants employed. To do this briefly and 
clearly, as weU as to give full generality to our theorems, we make 
use of the concept “field” (or domain of rationality). 

A set of complex numbers o + is called a field if the sum, 
difference, product, and quotient (the divisor not being zero) of 
any two equal or distinct numbers of the set are themselves num¬ 
bers belonging to the set.^ 

The following are examples of sets of numbers each forming a 
field: an complex numbers; aU real numbers; all rational num¬ 
bers; all numbers of the form a + br with a and 6 rational num¬ 
bers and r = i or r = 3*. 

But the set of all integers is not a field; nor do all positive real 
(or rational) numbers form a field. 

29- Linear independence.^ The quantities (numbers or func¬ 
tions) h,... ,ln are called linearly dependent with respect to a 
field F if there exist numbers Ci,..., c», not all zero, of F such that 

(25) cik+ ■■■ + (^l„ = 0. 

If no such numbers Ci exist, Zi,..., are called linearly inder 
pendeni with respect to F. 

It is convenient in proofs by induction on n to employ these 
terms also in case n = 1. Thus Zi is linearly dependent or inde¬ 
pendent according as Zi = 0 or Zi 0, respectively. 

^ Further fields satisfy the definition by postulates in the author’s Algebras and 
Their Arithmetics, University of Chicago Press, Oh. XI. An example is the field of 
the residues of integers modulo p, a prime. For it, our theorems become important 
properties of congruences. 

^We shall not presuppose the theory of systems of linear equations with coeffi¬ 
cients in any field F, but deduce that theory in §§31, 32 from the present discus¬ 
sion. For this reason we avoid the shorter proof of Theorem 9 by means of that 
theory. 
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Theoebm 9. If yi,, Vn are linearly independent with resped 
to afield F, the n linear functions 

n 

(26) li=^aiiyj {i = I, , n), 

/=i 

with coefficients in F, are linearly independent or dependent mth 
respect to F according as the determinant A = | ai/1 is not zero or is 
zero. 

First let A 5 ^ 0 and assume, contrary to the theorem, that a 
relation (25) holds. Then 

n n n 

^CiU^'^dauyi-O, 2fflj#Ci = 0 (j= 

i***l *, j=»l 

Hence A Cf = 0, and every Ci = 0, contrary to h 3 rpothesis. 

Conversely, if Ii,..., are linearly independent, then A 5 ^ 0 . 
We shall give a proof by induction on n, noting that it is evidently 
true when » = 1. If = 0 , 1 • = 0 would be a relation (25). 

Hence after permuting the y’s, we may assume that 9 ^ 0. 
Since our converse will follow if proved for h,, Zn-i, OnvT^ in, we 
may assume that a„» = 1. Then 


n—1 

Zi ^ 1% “ a^filn = ““ ain ^i)yj = 1, • • • , ?2> 1) 

j-1 


a.re linearly independent linear functions of ?/i,..., 2/n-i, since 


n —1 1 

Zi = CiQf% J'ji) “ 0 

i-1 i-1 

mplies that each c»- = 0 in view of the linear independence of 
i,..., Zn. Hence by the hypothesis for the induction, the deter- 
ninant 


D = I an - Oin Onf I {i,j 1 ) 
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is not zero. If in A we add to the first, second,..., (n — l)th 
colnnms the products of the elements of the last column by 
— a„i, — a„ 2 ,..., — a»n-i, respectively, we get a determinant 
whose last row is 0, 0,..., 0,1 and having D as the cofactor of 1. 
Hence A = D 0. 

Theorem 10. J/ j/i, ..., yn are linearly independent with respect 
to F, and k,... ,1m. are any linear forms in yi,... ,yn 'onth coeffir 
dents in F, we can select certain of the Vs, say 

Vi ~ (i 1 ,.. ., f), 

which are linearly independervt with respect to F and are such that the 
m — r remaining Vs are expressible linearly in terms of vu - 7 Vr 

with coeffidents in F. 

As vi we may take any which is not zero identically in the 
y^s. If every Z,* is a product of by a number of F, the theorem 
holds with r = 1. In the contrary case, take as V 2 any hz which is 
not such a product, whence rji and 172 are linearly independent. If 
every Zy is a linear combination of vi and 772 with coeflBlcients in F, 
the theorem holds with r = 2 . In the contrary case, take as vb 
any Z*, which is not such a combination, etc. 

The value of r is given by Theorem 13. 

Theorem 11. If r < n in Theorem 10 , there exist linear forms 
Vr+i 7 • * •, Vninyi, ... ,yn with coefficients in F snch that rju * 7 Vn 
are linearly independent with respect to F. 

Krsti yi; - • • > 2 /n are not all linear functions of 771 ,.. •, Vr with 
coefficients in F. For, if so, we have in particular 

r 

(27) yi ^ ^ ^ tih Vk (^ ~ 1, •. •, r). 

hri 

Hence by Theorem 9, | tik \ ^ 0. We may therefore solve (27) 
and obtain 17 * as a linear function of ^ 1 ,, yt with coefficients 
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in F. By hypothesis, y,- (i > r) is a linear function of the ij’s and 
hence oi yi,...., t/r with coefficients in F. This contradicts the 
linear independence oi yi,, y-n with respect to F. 

Hence not all linear functions of yi,..., with coefficients in 
F are linear functions of 171 , — ,i)r with coefficients in F. Thus 
there exists a linear function yr+i of the y’s such that 171 ,.. •, »jr, 
17 , 4.1 are linearly independent with respect iaF. If r + 1 = », the 
theorem is proved. If r + 1 < n, we repeat the argument m the 
first part with r replaced by r + 1 and conclude that yi,... ,yn 
are not all linear fimotions of 171 ,, 17,41 with coefficients in F. 
Thus there exists a linear fimction 17,42 of the ^’s such that 171 ,..., 
17,43 are linearly independent with respect to F. This proves the 
theorem ifr + 2 = n. Ifr + 2<», we repeat the argument. 

30. Equivalence of two matrices or two bilinear forms. Two 
bilinear forms a and ai in aii, ..., Xm, yi, ..., Vn, with coefficients 
in F, are called equivalent inFif and only if a becomes ai when we 
subject the a:’s in a to a linear transformation with non-singular 
matrix C and the ^’s to a linear transformation with non-singular 
matrix B, the elements of C and B being in F. Then also the ma¬ 
trices A and Ai of a and ai are called equivalent in F. By Theorem 
7, Ai — O'AB. In other words, two mXn matrices A and Ai 
with elements in i?" are called equivalent in F H and only if there 
exist two non-singular square matrices'- H and B with elements 
in F and having m and n rows, respectively, such that DAB = Ai. 

Theorem 12 . Two mairices (or bilinear forms) in F are equivor 
lent in F if and only if they fuwe Hie same rank. 

If the matrices (or bilinear forms) are equivalent, they have the 
same rank by Theorem 6 . 

Conversely, if a and at are any bilinear forms in F with the same 
rank, they are equivalent in F. This is evident if the rank is zero, 
whence a s 0 , ai = 0 . Consider therefore a bilinear form a, 

^ We have written D for the earlier C'. Conversely, given D, we get C = D\ 
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a Xi liy li — cLij y j •, •, 

j-i' 

in which the are numbers of F not all zero. By Theorem 10, we 
may select certain of the Z^s, say rn = (i “ 1, ..., r), which 
are linearly independent with respect to F and such that the m ^ r 
remaining Ps are linear functions of 9?i,..., i/r with coeflBlcients in 
jP. Since h,,., ,kr are distinct, we may select m r further 
k*s so that fei,..., form a permutation of 1,..., w. Also for 
i > r, denote hi by Introduce the new variables 

Xi ^ Xjci (z = 1, • « • , ?72r). 

Then 

m m r 

a= ^Xtilk{ = ^Xi'rn, 7ii=^dij7ij (i = r + 1,... ,m), 

*-i i-i j-i 

where the di# belong to F. Hence 

r m r r 

Xj ijj ”1“ Xi dij ijy = yfjf 

j-l i-r+l j'-l ,*-l 

if we write 


Sj = +2 (i = !>•••# r). 

t-r+l 


The latter and ~ r + 1 ,..., m) give m linearly inde¬ 

pendent linear functions gi,..., of Zi,..., with coeffi¬ 
cients in F. Hence the f s are linearly independent linear func¬ 
tions of rri,..., with coefficients in F. The determinant of the 
matrix M of the coefficients of xi,..., Xm in the f^s is not zero by 
Theorem 9. If r < w, there exist, by Theorem 11 , linear functioi^ 
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f r+i,.. •, fn of yi,..., with coefficients in F such that 171 ,, 
Vr, fr+ij..., fn are linearly independent with respect to F. If 
r = Thf-we suppress the f i and employ Vn- In either case, 

the determinant of the matrix N of the coefficients of yi,..., y» 
in these n linear functions is not zero by Theorem 9. 

We have now proved that a can be reduced to the canonical 
form 

(28) 

j-i 

by a linear transformation expressing the a:’s as linear functions 
of the I’s with the matrix M-^ and a linear transformation express¬ 
ing the y’s as linear functions of the ij’s and f’s with the matrix 
N~\ The matrix of (28) is evidently of rank r. By the first part 
of our theorem, also a is of rank r. 

Since a and ai are both equivalent to (28), they are equivalent 
to each other. 

We have also proved the first part of 

Theobbm 13. The r of Theorem 10 is the rank of the matrix of the 
coefficients of h.,, Im. Moreover, the rank of the matrix T of the 
coefficients of ni,..., Hr is r. 

Let p be the rank of T. Apply the first part of the theorem to 
571,..., ijr in place of li,..., Im. Hence if r > p, r — p of the 17 ’s 
are expressible linearly in terms of p of them with coefficients in 
F, contrary to the linear independence of 571 ,..., iir with respect 
toF. 

CoBOLLAET 1 . If ?» > n, any m linear forms in n independent 
variables with coefficients in JP are linearly dependent with respect 
ixtF. 

For, the rank r is evidently ^ n and hence < m. Then by 
Theorem 10, m — r of the forms are expressible in terms of r of 
them, so that the m forms are linearly dependent. 
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Note that, conversely, if the m forms are linearly dependent, 
then the rank r is less than m. Hence we have 

CoROiiLABY 2. Any m linear forms in n independent variables 
with coefficients in i?* are linearly dependent with respect to if 
and only if every m-rowed determinant of the matrix of the 
coefficients is zero. 


Exercises 

1. Find linear transformations which reduce 

2 x 1 y 2 3xi 2/3 4“ 4x2 yi — 2 x 2 2/3 ^ 3x8 2/1 + 2/a to ^ m + & wa- 

2. 6x + 32/ — 6, 8x + 15«, 6x + 2y -|- 52 — 4 are linearly dependent. 

3. When F is the field of all real numbers, restate Theorem 12 without 
using explicitly the notion of .a field. 

4. Any matrix of rank r is equivalent to one whose element in the ith row 
and ‘ith column is 1 when while all further elements are zero. 

5. Any matrix of rank r can be expressed as the sum of r matrices of rank 
unity. Hence any bilinear form of rank r can be written as the sum of r 
products each of a linear form in Xi,..., x„i by a linear form in 2 / 1 ,, 2/n* 

6. When a bilinear form is the sum of r products each of a linear form in 
Xi,..., Xm by a linear form in 2 / 1 ,..., 2/n, the matrix of the form is of rank r 
if and only if the r linear forms in the x^s are linearly independent and the r 
linear forms in the 2 /'s are linearly independent. 

7. If r is the rank of an ?^-rowed square matrix and if 2E is the rank of the 
matrix composed of the first t rows of the former, then iE S r + < — TZr. 

8. If ri and n are the ranks of two n-rowed square matrices, and if R is the 
rank of their product, show that i2 ^ ri 4- ra — ti. Hint: First prove this 
when the left factor is of the form in Ex. 4. 

31. Homogeneous linear equations. Consider a system of 
equations = 0 ,..., = 0 in the unknowns 2 /i> • • • ? 2 /n with 

coefficients in a field F and of rank r. After rearranging the equa¬ 
tions, we may assume by Theorem 13 that Zi,..., Zr are Knearly 
independent and have a matrix M of rank r, whfie Zr+i,..., Z« 
are linear combinations of Zi,. •., Zr with coefficients in F. Some 
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r^rowed determinant of ilf is not zero. After relabeling the y’s, we 
may therefore assume that the determinant D of the coefficients of 
Vi,... ,yT iu Zi = 0,..., Zr = 0 is not zero. Transpose to the 
second members the terms involving yi(j > r) and apply the 
ordinary method of solution by determinants. We obtain 

n 

(29) yii “ ^iiVi • • • > 

7«r+l 

where the c.-/ are in F. When these values of yi,, Vr are 
inserted in Zi,..., Zr, we therefore obtain expressions in 2/r+i,..., 
yn which are zero for all values of the latter. The same follows for 
Ir+i,... flm since they are linear combinations of Zi,. •., Zr. 

Theorem 14. Given m homogeneous linear equations in n unr 
knowns whose coefficients belong to any field F and have a matrix of 
rank r, we may select r of the equations so that their matrix has a 
novrDanishing r-rowed determinant. These r equations determine 
uniquely r of the unknowns as homogeneous linear functions, with 
coeffi/dents in F, of the remaining n — r unknowns. For all values of 
the latter, the expressions for the r unknowns satisfy the given m 
equations. 

Corollary. There exist solutions not all zero if and only if 
r <n and certainly Urn <n. In particular, n homogeneous linear 
equations in n unknowns have solutions not all zero if and only if 
the determinant of the coefficients is zero. 

If the homogeneous linear equations Zi = 0,..., Zm = 0 have 
the two solutions (wi,..., Un) and {vx,..., t;„), they evidently 
have the solution {gux + hvi ,..., gUn + hvn), in which g and 
h are any numbers. We call the two solutions linearly dependent 
if there exist constants g and h, not both zero, such that ffUi + hvi 
=0 (i = 1,..., n); but linearly independent if no such constants 
g and h exist. Analogous definitions apply to more than two 
solutions. 
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For example, the single equation 2yi + ^Vi — 5^3 = 0 has the 
linearly independent solutions 

(wi, Ui, Ms) = (- 3/2,1,0), (»i, 1)2, Vi) = (5/2, 0,1), 

while every solution (j/i, yi, yi) is linearly dependent on them: 

yi = Ml 4* yi Vi, Vi — yiV/i yiVi, ys = ^2 ms + yi vs. 

This example illustrates the following 

Thboebm 15. Any system of m homogeneous linear equations in 
2/i> • • ■ > J/n of rank r < n has n — r linearly independent solutions, 
while every solution is linearly dependent on them. 

With the notations (29), we may take as the n — r solutions 

(30) (cij-, ... ,Cri, 0,..., 0, 1, 0,..., 0) (j = r+1, ... ,n), 

where 1 is in the jth place. Multiply the numbers in (30) by y,-, 
sum as to j, and apply (29); we get the solution 


y i Oli yx, . . . , ^^y / Cr J yr, , yn), 

in which the summations extend from j = r + 1 to j = n: Since 
these numbers are aU zero if and only ^ ..., are all zero, 

the solutions (30) are linearly independent. 

32. Non-homogeneous linear equations. Consider a system of 
m equations Zi = ftij= km, where the k’s are constants and 
h,... ,lm are linear forms in yi,..., y„ whose matrix A is of rank 
r. After rearranging the equations, we may assume that fi,..., Z, 
are linearly independent, while 

r 

~ ^3 (i r 1, -., ^ m). 

j-i 


(31) 
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Hence our m equations It = hi are inconsistent unless 

r 

(32) ki = '2^Ciiki (i — r+ 1,m). 

j-i 

Write \i = li — ki y«+i, where yn+i — 1 . Since li,... ,lr are 
linearly independent, the same is true of "Ki, ..., Xr. Next, 

r 

~ ^ ss Y* •"{- j 9?2») 

7-1 

if and only if conditions (32) hold, and then the matrix of Xi,..., 
\m is also of rank r by Theorem 13. 

TheoremT6. Let r be the rank of the matrix A of the homogeneous 
linear functions h,... ,lm of yi,..., Vn* Let B be the augmented 
matrix derived from A by annexing a column halving the elements 
fci,..., km, so that the rank p of B is ^ r. The equations li = fci, 
..., Im = km are inconsistent if p > r. If p ^ r, certain r of the 
equations determine uniquely r of the unknowns as linear functions 
of the remaining n r unknowns; for all values of the latter^ the 
expressions for these r unknowns satisfy also the rermining m-- r 
equaticms. 




Chapter IV 


QXJADBATIC AND HERMITIAN FORMS, SYMMETRIC 
AND HERMITIAN BILINEAR FORMS 

Our discussion of these four types of forms requires but little 
more space and effort than are necessary for a treatment of 
quadratic forms alone. There has been a steady increase in the 
importance of the forms named after Hermite. Our results on 
quadratic forins suffice both for their metric and their projective 
classification. 

33 . Synunetiic bilinear forms and quadratic forms. The form 

n 

(1) ~ Clif 

if 7-1 

is called a synmetric bilinear form if it remains unaltered by the 
interchange of Xi and yt for i = 1 ,..., w, and hence if 

(2) (iji = dii = 1,.. - , 7i), 

The matrix A = (Ui,) of a then remains unaltered by the inter¬ 
change of rows and corresponding columns, and is called a sym¬ 
metric matrix. In other words, A is symmetric if and only if it is 
equal to its transpose A'. 

By Theorem 7 of Ch. Ill, the transformation 

n 

(3) xt = '^h{fXj (i = l, ...,n), 

i-i. 

with the matrix B = and the cogredient transformation 

(4) (i = l,...,n), 

7-1 
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with the same matrix B, replace (1) by a bilinear form «i, whose 
matrix is Ai == B'AB, If A' = A, then A'l = B'A'B = At by 
Theorem 8 of Ch. Ill, whence also matrix At is symmetric. 

Any two ? 2 rrowed square matrices A and At with elements in a 
field F are called congruent in F ii there exists a non-singular Hr 
rowed square matrix B with elements in F such that At = B'AB, 
By Theorem 6 of Ch. Ill, congruent matrices have the same rank. 
Recalling the definition of equivalence in §30, we have 

Thboeem 1. Two symmetric bilinear forms with coefficients in a 
field F are equivalent under norirsingular cogredient transformations 
(3) and (4) with coefficients in F if and only if their matrices are 
congruent in F. 

If we identify yi with Xi for i = 1,..., n, a symmetric bilinear 
form ( 1 ) becomes a quadratic form 

n 

(5) g ^ 

if ?-l 

having the same symmetric matrix A — A\ For w = 2 , 
q = Uii Xt? + 2^12 3/1 iC 2 + Oqz 

The rank and determinant of A are called the rank and determi¬ 
nant of q. 

If also in (4) we identify y^ with Xif and 7,- and Z,*, so that (4) 
becomes (3), we conclude that transformation (3) with the matrix 
B replaces a quadratic form (5) with the symmetric matrix A by a 
quadratic form with the symmetric matrix R'AB. 

Hence we have 

Theoebm 2. Two quadratic forms with coefficients in a field F 
are equivalent under norirsingular transformation with coefficients in 
F if and only if their matrices are congruent in F. 

A comparison of Theorems 1 and 2 yields 
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CoBOLLAET 1. Two Symmetric bilinear forms with matrices A 
and Ai are equivalent under cogredient transformations if and 
only if the quadratic forms with matrices A and Ai are equivalent, 
where the forms and equivalence relate to the same field F. 

Hence these two problems of equivalence are not essentially 
different and each may be treated by a study of the congruence of 
symmetric matrices. 

34. Heimitiaa forms and Hermitian bilinear fonns. Let S be 
any field containing a number v which is not the square of any 
number of S. Then all the numbers x — a-\- bvi, in which a 
and b belong to S, form a larger field F. Call x = a — bi^ the 
conjugate of x. 

An important example is that in which S is the field of all real 
numbers, and j> = — 1. Then F is the field of all complex numbers, 
and X and x are conjugate imaginaries. 

The bilinear form, with coefficients in F, 

n 

( 6 ) 2 Vh 

i, 3^1 


is called aa Hermitian bilinear form in F, 

If we replace each element of a matrix A by its conjugate^ we 
obtain a matrix denoted by A and called the conjugate of A, Hence 
the matrix of (6) has the property 

(7) A' = I, 

and is called an Hermitian matrix. It is seen at once that 

(8) M = ikr, MN^MN, 

By Theorem 7 of Ch. Ill, transformation (3) with the matrix 
B and the conjugate transformation 
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(9) 2/i = E bo- y/ (i = 1, •. • , n), 

J-1 

with the conjugate matrix B, replace (6) by a form whose matrix 
is Ai = B'AB, Then (7) implies 

Ai' = B'A'B « Ii. 

Theorem 3. A linear transformation with matrix B and the conr 
jugate transformation replace an Hermitian bilinear form with the 
Hermitian matrix A by an Hermitian bilinear form with the 
Hermitian matrix B'AB, 

Any two Tir-rowed square matrices A and Ai with elements in a 
field F shall be called conjunctive in if there exists a non-singular 
7^-rowed square matrix B with elements in F such that 
Ai = B'AB. By Theorem 6 of Ch. Ill, conjunctive matrices 
have the same rank. Theorem 3 implies 

Theorem 4. Two Hermitian bilinear forms with coefficients in 
the field F are equivalent under a novrsingular transformation and its 
conjugate with coeffiicients in F if and only if their matrices are conr 
junctive in F. 

If in (6) we identify yj with 5,-, we get the Hermitian form 

n 

( 10 ) 

i, 

This form is equal to its conjugate. If we also identify F,- with 
Z/, we see that (9) become equations which follow from (3) by 
taldng conjugates. Hence Theorem 3 implies 

Theorem 5. A linear transformation with the matrix B replaces 
an Hermitian form with the Hermitian matrix A by an Hermitian 
form with the Hermitian matrix B'AB. 

This implies the following analog of Theorem 4: 
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Thbobem 6. Two Hermitian forms with coefficients in the field F 
are equivalent under a nortrsingvlar transformation with coeffitdents 
in F if and only if their matrices are conjunctive in F. 

A comparison of Theorems 4 and 6 yields 

CoEOLLABT 2. Two Hermitian bilinear forms with matrices A 
and Ai are equivalent under a non-singular transformation and its 
conjugate if and only if the Hermitian forms with matrices A and 
Ai are equivalent, where the forms and equivalence relate to the 
same field F. 

Hence these two problems of equivalence are not essentially 
different. From this and the corresponding result in Corollary 1, 
we see that the conclusions in our further study of two quadratic 
or two Hermitian forms imply corresponding conclusions concern¬ 
ing two symmetric or two Hermitian bilinear forms. 

Not only have the latter two problems been reduced to the 
former two, but the former two may be treated together by the 
simple device next explained. 

35. Rational reduction of quadratic and Hermitian forms. To 
each element a of the field F we make correspond an element a of 
F such that 

(11) a = a, ag = ag, a± g = a±g. 

Hence to a matrix A with elements in F corresponds a matrix 
A whose element in the ith row andjth column is o<y. Hence, as in 
(8), we have 

(12) A=A, Tq = AQ, (I)' = A'. 

If F is obtained as in §34 from a subfield S by adjoining v^, and 
if we take o to be a for every o in F, we see that relations (11) 
hold. They evidently hold also if we take a to be a itself. These 
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two cases axe the ones needed in Hennitian and quadratic forms 
respectively, as we shall next explain in detail. 

Consider the form, with coefficients in F, 

fl 

(13) Otij x% x^f 
i, i~l 

whose matrix A therefore has the property 

(14) A' = 1. 

If S = S for every a in F, and if = 5,-, (13) becomes the Henni¬ 
tian form (10) whose matrix A is Hennitian and hence has the 
property A' — A. But if S = o for every ainF, and HSj = ®,-, 
(13) becomes the quadratic form (6) whose matrix A is S3unmetric 
and hence has the property A' = A. Hence the form (13) fur¬ 
nishes a generalization of Hennitian and quadratic forms, which 

will be seen to effect a unification of the two theories of those forms. 

\ 

Theorem 7. By a iioi^singidar transformation^ with coefficients 
in afield F, any form (13) of rank r with coeffiicients in F can he re- 
duced to one of type 

(15) *^1 2?! “i" * ' ■ {cOCh ff i 9^ Of Qi ^ ^t)* 

We shall first' transform (13) into a like form lXi^i+ ■ • • 
having I 9 ^ 0, If an — 0 and a** 9 ^ 0, the transformation 

xi = Xkf Xh = Xi, Xi = Xi (i 9 ^ If h) 

replaces (13) by a** Xi + • - • • If every a« = 0 and a^ 9 ^ 0, 
the transformation 

Xi = Xi, X 2 = gXi + X 2 , Xi = Xi (i > 2) 

^ 1 Transformation (3), of matrix B, together with the^ induced transformation 
Xi ==: 2/%j The matrix of the resulting form is B'AB and has the same rank 
as by Theorem 6 of Ch. BI. 
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replaces (13) by IXiXi , where 1 = + <hig. Taking 

g — 1 / 021 , we have Z = 2 . 

It remains to consider a form a of type (13) having ou 0 . 
Then 

1 « 
oj = — (au iCi + X/ (^11 ^ 1+2 ^ 1 / 

^11 i -2 /-2 

+ ^ (a; 2 ,..., ii?n, ?2 ,..., 5»). 

Denoting the quantity in the first parenthesis by an Xi, we see 
that that in the second parenthesis is equal to an Xi, and hence 
that the transformation 

I 

1 ” 

= Xi-2 > 1) 

^11 t »2 

replaces a by 

an Xi Xi + 0 (X^ 2 ,..., Xn, X 2 ,.,., Xn). 

Since the matrices of a and On Xi have the property (14), the 
matrix of 0 has that property. We may therefore proceed with 
as we did with (13). Ultimately we obtain (15). The number of 
its terms is evidently its rank and hence is the rank of (13) by the 
preceding footnote. 

CoROLLAjRY 3. Within any field any quadratic form of rank r 
is equivalent to 

(16) + • • • + ffr iCr® (each gi 9 ^ 0 ). 

36. Canonical forms. When F is the field of all complex num¬ 
bers, (16) is reduced to ^ by the transformation 

Xi = gr^ (i = 1 ,..., r). 

Theobem 8. Within the field of all complex numbers, every guadr 
raticform of rank t can be reduced to Ji® + • • • + 62 / ^ noifir 
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singtdar linear transfcrmaiion. Hence two quadratic forms are 
equivalent if and only if they are of the same rank. 

From this theorem and Corollary 1 we have 

CoboliiABy4. Two symmetric bilinear forms are equivalent 
under non-singular cogredient transformations with complex 
coefficients if and only if they have the same rank r. Each can be 
reduced to |i i?i + • • • 4* Ir i?r* 

Theorem 9. Let Fbeffie field of all reed numbers or the field of aU 
compUz numbers. In the respective cases, take a = a or a for every 
a in F. By a non-singvlar transformation with coefikients in F, any 
farm (13) of rank r with coefikients in F can be reduced to. 

( 17 ) yiyx+ - V yp% - - - Vryr, 

where the index p is uniquely determined. Hence two forms (13) are 
equivalent in F if and only if they have the same rank r and the same 
index p. 

Since in (15), each gi is real. After applying to (16) the 

transformation which permutes certain of the x’s, we may assume 
that gi,... ,gp axe positive and gi(j > p) is negative. Write 
g,- = — dj(j > p). Then the transformation 

Xi == gr^Vi {i = 1,.. .,2?), 

Xj = dj~^ y j (y = 2? “h 1,..,, 7*), 

with real coefficients, replaces (16) by (17). Hence any form 
(13) of rank r with coefficients in F can be reduced by a non¬ 
singular transformation with coefficients in -F to one of type (17). 
Suppose that it can be reduced also to 

(18) 01 + * • • + 0g 2 3 — 0g+l Sg+i — • ■ ■ Zr 2r- 

We shall prove that p = S- Assume that p> By hypothesis, 
(17) and (18) are each equal to the given form (13) when the p’s 
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and z’s are certain linear functions of iCi,..., Xn with coefficients 
in F, Hence (17) and (18) are equal for all values of the x's. 

The system of q + n — p<n homogeneous linear equations 

(19) = Oj . . . ; Zg =0, yp+l ~ 0; . . • j 2 /n “ ^ 

in the n unknowns oji,.,., a;„ has solutions xi - Xi,Xn = Xn 
in F, not all zero, by the Corollary to Theorem 14 of Ch. III. Let 
Yi and Zi be the corresponding values of yi and Zi respectively. 
Equating the resulting values of (17) and (18), we get 

?1 + ••• + =“• < 2 ^ 2+1 ‘ ^ Zr^v 

The terms on the left are zero or positive, while those on the right 
are zero or negative. Hence every term is zero. In particular, 
Yif,.. fYp are zero. In view also of (19), we see that the system 
of equations = 0 ,..., == 0 has the solutions Xi,. .., Xn, 

not all zero. Hence the determinant of the coefficient of the re’s in 
2 / 1 , • • • j 2 /n is zero by the Corollary cited. In other words, the 
transformation which reduced (13) to (17) is singular, contrary 
to hypothesis. 

Separating the two cases of Theorem 9, we have 

CoEOLLARY 5. Any real quadratic form of rank r can be re¬ 
duced by a non-singular real transformation to 

(20) j/i® +- \-yp^- yp\i - - yr^ 

where the index p is uniquely^ determined. Hence two real quad¬ 
ratic forms are equivalent under non-singular real transformation 
if and only if they have the same rank and the same index. 

For r = 2 , the types are yi^ -f 2 / 2 ^ 2 /i^ ~ 2 / 2 ^ - 2 /i^ - 2 / 2 ®. 

^Discovered independently by Jacobi, Jour, fiir Matb., 63, 1857, 276, 281; 
Werke, 4,1884,693, and Sylvester (who called the uniqueness of ^ the law of inertia 
of quadratic forms), Phil. Mag., (4), 4, 1852,140; Phil. Trans., London, 143, 1853, 
481; Jour, fiir Math., 100, 1887, 465; CoU, Math. Papers, I. 1904, 376-7, 611; IV, 
523. 
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Corollary 6. Within the field of complex numbers, any 
Hermitian form of rank r can be reduced to 

(21) yiyi+ - Vypyp- yp+\ Vp+i- - Vr Vr, 

where the index p is uniquely determined. Hence two Hermitian 
forms are equivalent if and only if they have the same rank r and 
the same index p. 

The rank r of a quadratic or Hermitian form / is the maximum 
order of a non-vanishing determinant of the matrix of / and hence 
may be found from / without resort to the canonical form. A 
corresponding direct method of finding the index p is given in 
Theorem 22. 

Corollaries 1, 2, 5, 6 imply 

Corollary 7. There is a unique determination of p in the 
canonical form 

Xiyi+ - hXpVp- Xp+i 2 / 33+1 -- XrVr 

to which a real symmetric bilinear form can be reduced by non- 
singular real cogredient transformations, or to which a Hermitian 
bilinear form with complex coefficients can be reduced by a non¬ 
singular transformation on the a;^s and the conjugate transforma¬ 
tion on the 2 /^s. 

A real quadratic form is called positive if its index and rank are 
equal, and negative if its index is zero. Both are called definite 
forms. A positive form of rank r is equivalent under real trans¬ 
formation to 2 / 1 ^ -h • • • + 2 /r ^7 while a negative one is equivalent 
to - 2 / 1 ^ - .. ■ - 2/r^« 

Positive and negative Hermitian forms are defined similarly. 
They are equivalent to (21) with p — r and p = 0, respec¬ 
tively. 
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37« Reduction of a real quadratic form by real orthogonal 
transformation. We shall prove simultaneously the two compan¬ 
ion theorems: 

Theorem 10. If f and h are real n-ary quadratic forms and if h 
is positive and of rank the pair /, h can be reduced by a real non- 
singular linear transformation to 

/ = Xi f + • • • + Xn A = + ... -f- 

where Xi,..., X„ are the roots o/1 / — XA | = 0 and are all real. 

Theorem 11. If f and h are n-ary Hermitian forms and if h is 
positive and of rank n, the pair /, h can be reduced by a non-singular 
linear transformation to 

/ = Xl Ii + • • ' + X„ In In, A = |l |l + • * • + In fw 

where Xi,..., Xn are the roots of \f — XX | = 0 and are all real. 

Let A “ iflii) and B = ( 6 * 3 ) be the matrices of / and h. Let 
Xi be any root of | A — XB | = 0. Then the system of n linear 
equations 

tt n 

53 o.ii “ Xi53 (i = 1 ,..., n) 

/-I /-I 

become after transposition of terms a system of ri homogeneous 
linear equations in the Xj whose determinant is | A — Xi B | = 0, 
and hence have solutions .. ^Xn. not all zero. Then 

/ — ^ciiiXiXj =53^* ^y^bijXj == XiA, h ^^haXiXj. 

t. / i 3 », 7 

First, let / and h be Hermitian. Then A 5 *^ 0 by the assump¬ 
tion that h is positive and of rank n. Since f and h are re^, 
also Xi is real. 
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Next, let A be a real positive quadratic form. Consider the 
Hennitian form H = S having the same coeflScients as 

h. Write a:, = |, + iv», where J and ni are real. Then H is the 
sum which by hypothesis is positive unless the J’s 

are all zero, and 2 which is positive unless the v’s are 

all zero. Hence H is positive unless the x’s are aH zero. Thus 
Xi is real by the first case. 

Thus / — Xi A is a real quadratic form or a Hennitian form in 
the respective cases. Since its determinant is zero, it can be 
reduced by Theorem 7 to a real quadratic or Hennitian form 
‘ iVn) lacking one variable yi by a nonHsingular linear 
transformation which is real in the quadratic case. Let the same 
teansformation replace h by H. Hence it replaces 

f — XA= / — XiA + (Xi — X)A 
by 

(22) / - XA = F(y„ ..., y„) + (Xi - X)H(yi,..., y«). 

In the positive form H of rank n, the coefficient of yi yi is not 
zero, since otherwise H would vanish when yi = 1, y* = 0 (i > 1). 

Hsnce by the last part of the proof of Theorem 7, there exists .a 
non-singular linear transformation which leaves each yi(k > 1) 
unaltered and replaces H by a form cigigi + ^(yi,... ,yn), 
where Ci is positive. This transformation, which is real in the 
quadratic case, replaces the second member of (22) by 

yrd + (^1 - ^)Cl h + (^1 - • • • > Vn)- 

Write ^ for i?" -f Xi vJ-. Hence 

/ — XA s ^(yj,..,, y„) -j- (Xi — X)ci gi — X^(y 2 ,..., y„). 
Equating the terms free of X and those containing X, we have 
/ = h.Cigi^i + 0 (y 2 ,... ,y»), A S Ciyiyi + rPiVi, • • •, y»). 
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By the hypothesis on A, evidently i/' is positive and of rank 

— 1. Hence the discussion made for/, h, and Xi is applicable to 
and X2 and leads to the segregation of terms X2 C2 g 2 and 
C 2 gi 5^2. Repetitions of this argument show that / and h are re¬ 
ducible, by a non-singular transformation which is real in the 
quadratic case, to forms 

/ = Z) ffi ffij h=J^Cigi Si, 
in which the Ci are all positive. The transformation 

= Ci^ gi (z == 1,..., w) 

is real and non-singular, and yields the canonical forms given in 
Theorems 10, 11. 

As a special case of Theorem 10, we may take htoh^x^ + • • • 
+ Then the linear transformation leaves h unaltered and is 
called orthogonaL Similarly, if in Theorem 11 we take h to be 
XiXi+.,. + Xn Xn, the transformation leaves h unaltered and is 
called unitary. Theorems 10 and 11 therefore imply (and are really 
no more general than) the following two: 

Theorem 12. // / is a real guadroMc form of matrix A, the roots 
Xi,..., Xft 0/ its characteristic equation | A — XZ ( == 0 are all realf 
and f can be reduced to 2 ^ orthogonal transformation. 

Theorem 13. If f is any Hermitian form of matrix A, the roots 
Xi,. .., Xn of its characteristic equation | A — X7 | = 0 are aZZ real, 
and f can be reduced to 2 unitary transformation. 


Exercises 

1. A linear transformation with matrix B is orthogonal if and only B' B 
= I and is unitary if and only if R' S = 7. Hint: Take A = Ai = 7 in 
§§33, 34. 

2, The determinant of an orthogonal transformation is ± 1; that of a uni¬ 
tary transformation has the absolute value 1. 
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3. The product of two orthogoual transformations and the inverse of one 
are orthogonal. Likewise for unitary. 

4. If a linear transformation has two of the properties (i) is real, (ii) is 
orthogonal, (iii) is unitary, then it has the third property. 

5. Every real orthogonal transformation on x and y is of the form 

x' =s a; cos a — 2 / sin a, = .± (a; sin of + y cos a). 

6. The matrix of every unitary transformation of determinant unity on 
two variables is of the form 

C D- 

7. The transformation from one system of rectangular coordinates to 

another having the same origin is orthogonal. Then by means of Theorem 12 
show that the equation of any real quadric surface can be reduced to a form 
in which the terms of the second degree are ax^ + ± ax^ ± bj/S or 

arc*, where a, &, c are positive. 

8. In the canonical form (20), there are p positive and iV r — p negative 
coefficients. The number p — iV is called the signature. Two real quadratic 
forms are equivalent under real transformation if and only if they have the 
same rank and same signature. 

38. Rank of a syimnetric or Hemiitian matrix. 

Theoeem 14. If a certain r-rowed determinant d of a matrix is 
not zero and if all (r + lyrowed determinants having d as a first 
minor are zero, then the matrix is of rank r. 

Note that this theorem shortens the work of finding the rank r, 
since we need not prove that all (r + l)-rowed determinants are 
zero, but merely those having d as a first minor. - 

After interchanges of rows or columns or both, we obtain a 
matrix 




78 


QUADEATIC AND HEEMITIAN FORMS [Ch. IV 


in which 

^11 ' • * Oilr 

A - . 9^0. 

CLtI • • • Clrr 

The theorem will evidently foUow if proved for matrix A and 
determinant A. Write 

1% = diX yi "f* ' * * ‘4” ^tn Vn} 

(i = 1,.,,, m), 

Xi == 2/1 + ■ ■ • + Qfir Vn 

For s > r^t > Tj the determinant of order r + 1 of the forms 

X» + aityt (i = 1, s) 

is zero by hypothesis. Hence these r + 1 linear forms are linearly 
dependent by Theorem 9 of Ch. Ill, so that 

r 

(23) Xa + ^ Ci(Xi + a»-< 2/<)» 

i-l 

Since A 5*^ 0, we may solve equations X^ = a* 2/1 + * * • and 
express yi,..., 2 /r linearly in terms of Xi,..,, Xr. Substitution 
in Xa gives 

Xa — 61 Xi + • • • + Xr. 

Inserting this into (23) and noting that Xi,..., Xr, are linearly 
independent since 2 / 1 ,..., 2/r, Vt are, we see that 

r 

Oi hi ("f — 1,.. •, r), ttgt = hi dit* 

♦-1 

1 $ ^ Xa +X) ^ £ &»' (X< +2 2/0 “S &»■ 

i-r+l i-mi ««r+l <-l 


Then 
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Since Zr+i, * •. ,lm are therefore linear functions of Zi,_, h, 

which are linearly independent, matrix A of the Z's is of rank r by 
Theorem 13 of Ch. III. 

We shall next prove theorems which enable us to find the rank 
of a matrix A having the property A' ^ A hy slu examination of 
its principal minors only. A principal minor of any square matrix 
A is the determinant of the matrix obtained from A by deleting the 
same rows as columns; in other words, its diagonal elements all lie 
m the diagonal of A, 

Theoeem 15. The rank of a symmetric or Hermitian matrix A 
is r if an r-rowed principal minor pr is not zero and if zero is the value 
of every principal minor obtained by annexing to pr one row and the 
same column of A, and also of every principal minor obtained by 
annexing two rows and the same two columns. 

Without loss of generality we may assume that the elements of 
Pr lie in the first r rows and first r colunms of A == {aif). Let ga 
denote the determinant obtained from pr by annexing the ith. 
row and ^th column of A. Our theorem will foUow from Theorem 
14 if we prove that gij = 0 for all distinct values of i and j. Let 
c denote the determinant obtained from pr by annexing the ith 
and jth rows and the ith and jth columns of A. Let A»denote the 
cofactor of in c. By a standard theorem on determinants, 


(24) 


CPr 


A a Aif 

Aj’i Ajj 


— Qii Qii Qii Q 


By hypothesis, gu^ gr,-,-, and c are zero. Since A' = A, if we inter¬ 
change the rows and columns of ga, we get gij. Hence 


0 = gaga = gugih ga = 0 . 

Theoeem 16. If all (r + lyrowed principal minors and all 
(r + 2yrowed principal minors of a symmetric or Hermitian mairix 
A are zero, the rank of A is r or less. 
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This is true when r = 0, since an = a/,* = 0 and an a,,' — at,- 
a,-t = 0 imply that a^,- Si,- = 0 and hence that every element aij 
is zero, whence A is of rank zero. 

Proceeding by induction on r, we assume that the theorem is 
true when r == ft, that is, if aU (ft + l)-rowed principal minors and 
all (ft + 2)-rowed principal minors are zero, the rank of A. is ^ ft. 
To prove the theorem when r - ft + 1, let all (ft ■+■ 2)-rowed 
principal minors and all (ft + 3)-rowed principal minors be zero. 
Then if all (ft + l)“rowed principal minors are zero, the rank is 
g ft by the assumption just made. But if some (ft + l)-rowed 
principal minor is not zero, the rank is ft ■+• 1 by Theorem 15. In 
either case, the rank is ^ ft + 1; and the induction is complete. 

Theoeem 17. Any symmetric or Hermitian matrix of rank r 
(r > 0) contains an r-^rowed yrinciyal minor which is not z&ro} 

By the definition of rank r, all (r + l)-rowed principal minoi's 
are zero. If all r-rowed principal minors were zero, the rank would 
be g r — 1 by Theorem 16. 


Exeecises 

1. By means of Theorem 16 prove that 2 is the rank of 


-6 

- 5 

1 

- 2 ' 

- 5 

56 

4 

11 

1 

4 

0 

1 

-2 

11 

1 

2 j 


2. Call a matrix A sketo-symmetric if A' — — A. Its determinant is zero 
if of odd order. Extend Theorems 15-17 to skew-symmetric matrices. Using 
Theorem 17, prove that the rank of a skew-symmetric matrix is always even. 

3. If A is a real skew-symmetric matrix, P — iA is Hermitian. Hence give 
a more direct proof of Ex. 2. The characteristic equation j A — Xi* | = 0 of any 
real skew-symmetric matrix has only purely imaginary roots. 

1 For elementary, direct proofs, see Dickson, Wedderburn, and Bliss, Annals of 
Math., (2), 15, 1913, 27, 29; 16, 1914, 43. 
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4. By an aUemate bilinear form is meant 


n 

a = S akiXkVh *= 

&, j-i 


Show that, if an 9^ G, 

n 

X2 = ai2“H-X^2 “ S aiiXj), Xi = X< (^ = 1, 3, 4,..., w) 

7-3 

and the cogredient transformation on the replace ct by Xi Fa — Fi ^2 + 4 >i 
where 4 > is an alternate bilinear form in =* 2,..., w). Treating 4 > as 

we did a, prove that any alternate bilinear form a with coefficients in any 
field F is equivalent under cogredient non-singular transformations with 
coefficients in F to 

^V2 — fc & r?4 — S 4 9?8 + • * • + laa-l 172 s — §2a 

5. By an alternate Hermitian bilinear form is meant a form a in Ex. 4 in 
which now a/* = — Si/. If its matrix is A, show that B = is a Hermitian 
bilinear form. What is therefore the canonical form of a? 


Theorem 18. If M is a symmetric or Hermitian matrix of rank 
r (r > 0), tya may derive a regular symmetric or Hermitian matrix 
A — (flif) by a permutation of the rows of M and the same permuton 
tion of the corresponding columns such that no consecutive two of the 
numbers 


(25) Pq == 1^ 33i = ail, p2 


axi 

ai2 


,Pt = 

On • 

■ “ Oir 

Chl 

022 

> • • ' 

Orl • 

• • arr 



are zero, where the pi are principal minors and pr 0. 

Postponing the case in which all the elements of the diagonal of 
M are zero, let the tth element be not zero. After interchanging the 
first and tth rows and first and ith columns of M, we obtain a sym¬ 
metric or Hermitian matrix Mx having an 9 ^ 0. 

Postponing the case in which zero is the value of every two- 
rowed principal minor, containing an, of ilfi, letj be such that the 
two-rowed minor obtained by deleting all rows and columns 
except those numbered 1 and j is not zero. After interchanging the 
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second and yth rows and second and jth columns of Mi, we obtain 
a symmetric or Hermitian matrix M 2 having an 5 ^ 0, p 2 — 
Qn 0^22 — Q 21 ^ 0. 

Consider those three-rowed principal minors of M2 which have 
P 2 as a first minor and proceed as before. 

In this manner we will ultimately obtain a symmetric or Her¬ 
mitian matrix Mr in which no one of the numbers (25) is zero 
unless we meet one of the postponed cases. The general such case 
is as follows. Let each of the numbers pi = an, P 2 ,..., p* be 
not zero, where h <r, while zero is the value of every (^ + 1)- 
rowed principal minor having p* as a first minor. By Theorem 15, 
not every Qa + 2 )-rowed principal minor having pi as a second 
minor is zero, since otherwise the rank of M would be A < r. 
After a permutation of the rows numbered h + 1, h ■+ 2,... ,n 
and the same permutation of the corresponding columns, we have 
Pft+2 7 ^ 0, Pife+l == 0. 

To prove the theorem by induction, assume that we have 
reached a matrix B in which no consecutive two of the numbers 
Pi,..., p« are zero, while p, 5 *^ 0, s < r. If not all (s + l)-rowed 
principal minors (of B) having p, as a first minor are zero, we 
permute rows and also columns numbered s + 1,..., w and 
obtain a matrix with also p«+i 5 ^ 0. In the contrary case, we pro¬ 
ceed as in the preceding paragraph with fc == s and obtain a matrix 
with also Pa +2 ^ 0 , Pa+l “ 0 . 


39. Eronecker’s^ method of reduction. 

Theoeem 19, Consider a quadratic or Hermitian form 

n 

(26) Q = 0 = 2 a*,- an = S<,-, 

i, /-I 

of rank r (0 <r < n) with coefficients in (he fidd P. After a per¬ 
mutation qf tike a’s we may assume from Theorem 17 that | aj,-1 9^0 

1 Werke, 1,166, 357. Cf. Dickson, Trans, Amer. Math. Soc., 10,1909,129, who 
gave applications to modular invariants. 
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(j.,3 = 1,..., r). Then we may express the x’s in terms of new 
variables p’s by a linear transformation with coeffidenis in F of 
determinarU unity such that Xt — yt (i = r + 1,..., n) and smh 
that Q becomes 

r 

Vi Vi, 

<, j-i 

whose terms have the same coefficients as the corresponding terms of Q. 
According as Z denotes x or Z, write 2 = 2 or 1; then 

^ n). 

Since the determinant of these n linear forms is of rank r, Theorems 
10 and 13 of Ch. Ill show that (the first) r of them are linearly inde¬ 
pendent, while the remaining n—r are linearly dependent on those r*: 

(27) It =£‘=*‘77 (& = r + l,...,»), 

where the c^i are numbers of F, Consider the transformation 

n 

Xi = yi—'^ChiPk (j = l, 

(28) *"’^‘ 

Xk = yh (fc = r + 1,...,«) 

of determinant unity and the implied relations 

n 

(29) Xi ^ yi ^• * • > 

= Vk (ft — r + 1,. •., w). 

Hence 

dyh dyk dXk i-i dXi 

(ft =: y + 1, -.., w), 
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and, since Q = Q, 

^ = ^ _Vcfc.^ = 0 

^Vk {.1 ^X% dyjs dXk i«ai 

(fc = r + 1 ,,., n), 

by (27) and the relation obtained from it by placing the symbol 
over every number of it. Thus Q is transformed by (28) and (29) 
into a form Qi lacking y* and y* (fe = r + 1 ,..., n). Hence 
is derived from Q by writing Xi « y,*, Xi — yi (i = 1,..., r), 
iTft = fg* = 0 (Jc = r + 1,,n), so that Qi is the form given in 
the theorem. 

In the following two theorems, Aij denotes the cofactor of af,* 
in the determinant a of the form (26). 

Theorem 20. If Ann ^ 0, we may express the x’s in terms of new 
variables yi,, yn 6 y ct linear transformation with coefficients in F 
of determinant unity such that Xn ~ yn ond such that (26) becomes 

(30) ^ an Vi Vi + 7 — Vn Vn. 

i, J-1 

In proof, consider the form 

^ A ^ a , 

4 > = Oi,-a:,-a:,- - —a^,a^. 

/-I 

Its determinant is derived from a by replacing Onn by Onn — a/Ann 
and hence is equal to the sum of a and a determinant obtained from 
a by replacing the last row by 0 ,..., 0, — a/Ann- Hence the 
determinant of ^ is 

a “ Ann " Ot/Ann “ 0 . 

Thus is of rank n — 1. Theorem 19 therefore shows the existence 
of a transformation of the type specified in Theorem 20 which 
replaces 4> by 
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n-l 

Y/O-aViVi 

i, 

and Xn Xn by Vn Vn, and hence replaces (26) by (30). 

Theorem 2L If Ann = 0, An^i n-i = 0, An n-i ^ 0, we may 
express the x's in terms of new variables yi, , •,, Vnby a linear tram- 
formation with coefficients in F of determinant unity such that 
a?n-i = yn^i, Xn = and such that (26) becomes 

n—2 

(31) 2 ffli,- + gy»-\ Vn + VVr^ Vn-x, g = 

i, J-I 


Let 6 denote the detenninant obtained by deleting the last two 
rows and last two oolumns of a. Then, as in (24), 


(32) ctb 


■^n—1 n—1 
■^nn —1 


•^n—1 n 
■^nn 


•^nn —1 -^nn —1 


0 . 


Consider the form 




Its matrix is 

M = 


n 

“ ^ 3!y “ QXn—x Xj^ “• 


€, ;-l 




• • • ai n-l 

ain 

an^ll 

• • • An-i n-l 

ttn-ln ““ 9 

ani 

••• Unn’-l ^ 9 

Onn 


The determinant obtained by deleting the last two rows and last 
two columns is 6, which is not zero by (32) and is the pr of The¬ 
orem 16. Next, the principal minor of M obtained by deleting the 
last row and last column is Ann = 0> while that obtained by delet¬ 
ing the (n — l)th row and (n — l)th column is An-i n-i = 0 
[they are the only (n — l)-rowed principal minors having 6 as a 




QUADEATIC AND HERMITIAN FOEMS [CL IV 


first minor]. Finally, if d is the determinant of M, the theorem 
which gave (32) implies 




■Aft—1 n—1 

A* n_i + gh 


A„_i „ + .grj 

Ann 



where i = A„ „_i + gh, since 5 = &. Heplacing g by its value in 
(31) and using (32), we get 


An n—1 ^ — "h (fl^An n—1 u) — 0, t — 0. 


Hence db — 0. Thus d = 0. Hence Jlf is of rank n — 2 by 
Theorem 15. Hence there exists by Theorem 19 a transformation 
of the type specified in Theorem 21 which replaces f by 

n~-2 

^ Vi Vh 

t, J -1 

and therefore replaces (26) by (31). 

If we apply to (31) the transformation 

Vi = a;i(i = 1,..., w - 2), = ojn-i - \gXn, 

Vn ~ (l/f7)®n—1 ~jr 

of determinant unity, we obtain 


n-2 

(33) dif Xi Xf “h 2Xn-^ iTn—1 “ ^QQXn iCnj Q == (lfS.nn-.l,» 

i, ?-l 

Let Q be a form (26) of rank r with coeflBicients in a field F. 
We can find by means of Theorems 19-21 a linear transformation 
of determinant unity with coeflicients in P which replaces Q by 

(34) gixixi+ - )rgrXrXr (Qi^gi^O),. 

in which the gr's are obtained explicitly in terms of the coefiicients 
of Q by formulas which differ in the various subcases neces- 
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sary to consider. We first pass at once to the equivalent form Qi 
of Theorem 19. 

(i) If not all (r — l)-rowed principal minors of the determinant 
a of Qi are zero, we may permute the x’s and assume that the co- 
factor Arr of ttrr in a is not zero. We obtain the equivalent form 
(30) of Theorem 20 with n = r, having the segregated term 
(a/Arr)^r 

(ii) But if all (r — l)-rowed principal minors of a are zero, 
the cofactor An of at least one element aij(i ^j) is not zero, 
since the rank is r. Permuting the a:’s, we may assume that 
A, ,_i 7 ^ 0 . We obtain the equivalent form (33) with n = r 
having the last two terms segregated. 

Treating the unreduced part (under the summation sign) as in 
(i)or (ii), we finally obtain (34) inwhich the coefficients are ex¬ 
pressed m terms of determinants of the matrix of the given form Q. 

For example, if the first of the two alternatives (i) and (ii) 
holds true at each stage of the reduction, we obtain the equivalent 
form (34) with the coefficients 


Pi, 


Ps 
—> • 



po 

Pi 

Po 

Pr-2 

Pr-1 


where po = 1 and the remaining p’s are the special principal 
minors (25) and each is not zero in the present case. 

40. Number of positive coefficients in the canonical form. By 
Theorem 18, the variables of Q may be so permuted that no con¬ 
secutive two of the numbers po, pi,..., Pr are zero, while Pr 0; 
call such a form regular. 

Let the field F be real when Q is a quadratic form. When Q is an 
Hermitian form, let F be a field obtained from a real field S by the 
adjunction of v*, where p is in iS and is negative. Then if a; = a 
+ by*, where a and 6 are in < 8 , a:5! = a® — pfe® is positive unless 
a; = 0. In either case, p,- = pi is real, while the coefficients of the 
last two segregated terms of (33) are of opposite signs. 
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Let P be the number of permanences of sign and V the number 
of variations of sign in the sequence po, pi, ..., Pr, with the agree¬ 
ment that a vanishing pi is counted as positive or negative at 
pleasure, but must be counted. 

Let p be the number of positive coefficients and n the number 
of negative coefficients in the equivalent form (34). 

If each Pi 7^ 0, these coefficients are given by (35). For each 
permanence (variation) of sign, two consecutive p's are of like 
(opposite) signs and their quotient is a positive (negative) coeffi¬ 
cient (35), and conversely. Hence p = P, n = F. 

This result holds true also when any pi is zero. For, pi-i and 
Pi+i are then each 9 ^ 0 (since Q is regular) and have opposite 
signs by (32) with n = i + 1, a = pi+i, 6 = pi^i, Ann = Pi. By 
the final two terms of (33), we now have two coefficients of oppo¬ 
site signs instead of the two fractions (35) which involve pi. We 
have P = 7 = 1 for the sequence Pi-i, pi = 0, pi+i whose 
signs are + ± “* or — + + by the agreement concerning zero. 
In other words, by ignoring the permanences and variations of 
sign in this sequence of three terms, we reduce P, V, p, and n each 
by 1. After all such deletions due to vanishing pi's, we saw that 
p — P, n = V, Hence the latter hold also before the deletions. 

Theorem: 22. Within any real field, any quadratic form of rank r 
is equivalent to one of type gix^^ + - • + Within a field 

obtained from any real field S by the adjunction of the square root of a 
negative number of S, any Hermitian form of rank r is equivalent to 
one of type giXiXi+ - • + grXr Xr, with each gi in S. Then the 
number of positive g^s is the number of permanences of sign in the 
sequence po, pi,pr, no consecutive two of which are zero, as may 
be assumed after a suitable permutation of the x*s. 

For example, let Q = 4* Then po = 1, = 1, pa == 0, 

p8 = —1, P 4 = — 1, P = 3, 7 = 1. Since Q is equivalent to + 2 
+ X 4 ^, we have p = 3, n = 1, 
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THEORY OF LINEAR TRANSFORMATIONS, 
INVARIANT FACTORS AND ELEMENTARY DIVISORS 

Weierstrass^s elementary divisors and the related invariant 
factors are here introduced in a natural manner in connection with 
the classic and a new rational canonical form of linear transforma¬ 
tions, respectively. 

41. Rational canonical form of a linear transformation.^ Let 
... yWn be independent variables. Let S be any linear trans¬ 
formation 


S: Wi=^aijWi (i = l, 


with coefficients in any field F. We do not exclude the case in 
which the determinant of S is zero, so that the matrix of the coeffi¬ 
cients of S is perfectly general. 

Let be any homogeneous linear function of ..., Wn whose 
coefficients belong to F and are not aU zero. Let X\ denote the 
corresponding function of TVi,..., TVn. If Xi = cri, where c is 
in jP, we have (1) for a = 1, with the understanding that the final 
equation alone is retained. But if Zi cxi, we write x^ for Xi. 
Let Xa denote the same function of the Wi that x^ is of the Wi. 
Then if Xa is a linear function of Xi and ra, we have (1) for a — 2; 
in the contrary case we write xz for Xa and have three linearly inde¬ 
pendent functions Xi, Xz. Proceeding in this manner, we obtain 


^For the case in which the determinajit is,not zero, S. Lattes, Annales Fac. Sc, 
Toulouse, (3), 6,1914, 1-84. Also W. KruU, Uber Be^eitmatrizen und Elementax- 
teilertheorie, Freiburg thesis, 1921. In Leipzig. Berichte, 69, 1917, 326-35,^ G. 
Xowalewski employed points, instead of linear functions, and obtained a canonical 
form whose matrix is the transpose of that in the text. The text presents the theory 
as found independently by the author several years ago. 
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a chain of linearly independent functions iUi, __ Xa with 

coefficients in F such that S implies 

( 1 ) ^ OIjZ) X 2 ^ OSZf . ■*“ Xa) 1.^1) •• * ) 

where the bracket denotes a homogeneous linear function of 
Xij... yXa with coefficients in F. We call a the length of the 
chain, and Xi its leader. 

Theorem 1. By the introduction of new variables which are 
linearly independent homogeneous linear functions of the initial 
variables Wi with coefficients in the field F, any linear transformation 
S with coefficients in F may be reduced to a canonical form defined by 
(1) and 


( 2 ) Yx = y 2 ) F2 = ys, . . . , F5-1 = yh) Yh = bi,..., yh]] 


( 3 ) Zx — Zz ^ Zz^ , . , ^ Zc^x = Zc) Zc = [^1) • • •) Zc]] 

etc.f where a is the maximal length of all possible chains, b is the 
maximal length of a chain whose leader is linearly independent of 
Xx) • * * )Xa) and c is the maximal length of a chain whose leader is 
linearly independent 0/ aJi,..., oJo, yi,..., y^. 

If a = 1, ^ is Wf - auWi (i = 1 ,,.., n), and is in canonical 
form with n chains. Henceforth let a > 1 and select a function 
xx which is the leader of a chain of maximal length a, so that we 
have (1). If n = a, the theorem is proved. 

For n> a, let Ux be any homogeneous linear function of the 
initial variables Wi with coefficients in F which is linearly inde¬ 
pendent of tci,..., Xa> Let Ux denote the corresponding function 
of the W i. If J7i = [xx,,,. yXa, Ux\, we have (4) for 6 = 1. In 
thecontrary case, write 2^2 for77i. Then if U 2 = [i»i,..., Xa, ux, u^], 
we have (4) for 6 = 2. In the contrary case, write Uz for 1/2. 
Proceeding in this manner, we get 

(4) Ux — Uz, C/2 = ws,..., i7&«i = Uh, 17j, = Z + [ux ,..., Uh[, 
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where I = [tCi,..., ajJ, and xi,..., Xa, «i,..., are linearly 
independent. If 1 = 0, (4) is the desired result (2) with yi — Ui. 

Next, let Z ^ 0. Then I + [wi,..., U},\ is linearly independent 
of «i,..., Ub, so that Ui is the leader of a chain whose length ex¬ 
ceeds h. Hence h < a. Introduce the new variables 

yi = Ul-\- 2/2 = + /i> • • • j ^6 = ^6 + /&_!, 

where / is a homogeneous linear function of xi,, Xa with 
coefficients in F, and S replaces/by/i, /i by/ 2 , etc. We can choose^ 
/ so that (2) follows. 

To obtain the third chain, let vi be any homogeneous linear func¬ 
tion of the initial variables Wi with coefficients in F such that 
Xi,... ,Xtt,yi,, 2/6j vi are linearly independent. Let Vi denote 
the corresponding function of the TFi. If Vi = [xi,... ,Xa, 
2/i> • • • j 2/6) *’J) we have (5) for c = 1. In the contrary case, write 
Vi for Fi. Then if F 2 = [aii,..., *«, yi, ■.., Vb, wi, vi\, we have 
(6) for c = 2; in the contrary case, write for Fa. Proceeding in 
this manner, we get 

(5) Fi = »2, Fa = «8,..., Vc-i -Vc, Ve = l + m, 
where 

Z “ [^ 1 , . . . (, Xaj 2 /I) • • • ) 2/6]) 272 — Si Vx “j- ‘ * -j- S^V^. 

Here »!,..., Xa, yx,...,yb, Vx,...,Ve are linearly independent, 
and the «,■ are numbers of F. If Z s 0, we have the desired result 
(3) with Zi = Vi. 

Next, let Z ^ 0. Then Z + m is linearly independent of Vx,..,Ve, 
so that is the leader of a chain whose length exceeds c. Since »i 
is linearly independent of aii,..., Xa and since & is the maTrimuTn 
length of a chain whose leader is linearly independent of Xx,.., Xa, 
we have c < 6. Introduce the new variables 

1 The proof is like that concemmg (6) with the yi suppressed and z% replaced by 
yi, c by 5, and, beginning with (8), also h by a. We prefer to omit the details here 
and treat the more t 3 i>ical derivation of the third chain. 
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(6) 2l = t*! + /, = 1h f\, . . . , Zc = ve fo-l> 


wtere / = /o is a homogeneous linear function of Xi,. . Xa, 
yi, ■ ‘ • ,yb with coefficients in F, while S replaces / by fi, fi by / 2 , 
etc. Let Zj denote the same function of the Wi that z,- is of the Wi. 
Hence 

C 

(7) Zi = Z 2 == , -Zc-i = Zc ^ Si Zi, 

/ i-i 

where 


ll = 1.+ SifM, 

i-l 

We can choose / so that h becomes a homogeneous linear func¬ 
tion of Xi, yi(i = 1,..., c). The proof is simpler by successive 
steps. First, if i is the coefficient of Xa in h take / = — kxa^c, 
whence/c = — kxa, and h lacks Xa- Since (7) is of the form (5), 
we may therefore assume that I itself lacks Xa> Then if i is the 
coefficient of Xa-i in Z, take / = — fcCa-c-i, whence fc — tXa^u 
and ll lacks both Xa-i and Xa> The last of these steps employs 
f — gxi to delete Xc+i from Zi. Similarly, we can choose the 
coefficients of yh^cj 2/6-c-i,..., yi of / to delete yi, y^^i ,.. ., 
2/c+i from Zi. 

If the resulting function Zi is identically zero, (7) is the desired 
result (3). In the contrary case, write 

Z» = Zi aJi + • ‘ + Zc iTc+t-i + Wi 2/t + • • • + Wc 2/c+i-i 

(i = 1,..., 6 - c + 1). 

Since we proved that h is given by this formula with i = 1, the 
Z's and m^s are not all zero. Let Li denote the corresponding func¬ 
tion of the Z’s and F's, Hence 

Li = Zi+i 


( 8 ) 


^ 1, m m t , f} — C). 
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Evidently = 1,..., & — c + 1) are lineajly independent 
linear functions of Xj, yjij = l,...,b). By (7) and (8), the 
chain with the leader Zi has the successive terms 

C 

• • • j ^c} ^c+j ~ ^j* + 2!^ Si Zi^f^i (y = 1,.., p b — c +1), 
i=*l 

and possibly further terms. By (6), the 6 + 1 terms z are linearly 
independent. Since the leader Zi is linearly independent of iCi,..., 
Xa, this result contradicts the definition of 6. 

42. Theorem 2. In a canonical transformation the characteristic 
determinant of the partial transformation on any chain is divisible by 
that of the next chain. 

For definiteness let the chains be the second and the third, and 
let the partial transformations on them be 

(9) 7i = 2/2j 72 = 2/3,, 7&.-1 = 2/6; Yh =2^ &<2/iJ 

c 

(10) Zi ^ Z2, Zi = Zsj ... f Zc^i = Zc) Zc = 2!) 

Then c ^ 6. Consider also the linear functions 

(H) ^C+l ^ Zc) ^c-^2 = 7c+i, , , . , = Zf,, 

which may be computed in terms of Zij... ^ Zc hy the recursion 
formula 

c 

(12) ~ 2^ ^4 (i ~ 1,..., 6 *— c + 1), 

The chain with the leader Wi = yi Zi has the successive terms 
= Vi + ^i(j =!,...,&), which are linearly independent since 
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2 / 1 ,. •., 2 / 6 , . ■ •, are independent. For, if 2 + «/) 

= 0 , then ^lijyj + Z — O, where Z is a linear function of 
zi,...,Ze, whence each hj - 0, Z ~ 0. Since h is the majdmaJ 
length of a chain whose leader is linearly independent of «i,..., iCo, 
the chain led by loi is of length ^ 6 , and hence is of length 6 . 
WriteTT/= 7^ + .^,-. Then 


6 

176 = 2 2/4 + 2&+1 ® [«’l, • • • » M’6] 

t-1 

= 52 “’I = 2 + *<), 

whence 

h 

(13) 26+1 = 2 «<■ 


If c = 6 , then 25+1 =J^CiZi by (10) or (12), whence c,- = 6 ,-, so 
that ( 9 ) and ( 10 ) have the same characteristic determinant. 
Next, let c < 5. If from the difference of the two members of 

(13) we eliminate 26 +i, 25 ,..., 2 c^ in turn by means of ( 12 ), we 
obtain a linear function of 21 ,..., 2 <„ whose coefficients must all 
be zero. By a mere change of notation, we conclude that, if we 
eliminate X*,..., X“+^ in turn from 

6 

(14) X»+i 

i-1 

by means of 

(15) =i:c<X^+>-‘ (j = 1 , ..., 6 -c + l), 

{-.1 

we obtain a linear function of X,_, X® whose coefficients are all 

zero. These eliminations are evidently the successive steps in the 
ordinary process of division of (14) by 
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(16) X'+i - E Ci X*. 

i-1 


* Hence (14) is exactly divisible by (16). Removing the factor X 
from each, we obtain the characteristic determinants of ( 9 ) and 
(10) respectively, signs apart. For, if iV is the matrix of ( 9 ), its 
X-matrix iV — X7 is 



X 

1 

0 

0 

... 0 

(17) 

0 

~ X 

1 

0 

... 0 



62 

h 

64 

■ • • 65 —X 




whose determinant, called the characteristic determinant of Nj is 
(18) (- 1)^ (X^ - &i - 62 X ~ 63 X* --X^-i). 

Example 1. Let be TFi = awi + Zws, W2 - Wi, Ws aws. 

Employing the new variables Xi — vh, — Wi, y - ws — Wi, Xi - PTg, etc., 
we get 

(7 : Xi ^ Xtf Xz = hxi + axif Y ^ ay. 

Apparently we have a canonical form composed of a chain tci, xz and a chain 
y. But the characteristic determinant X® — a\ — & of the partial transforma¬ 
tion on xi and xz is divisible by the characteristic determinant a — X of 
Y — ay only when 6 — 0. If 6 «= 0, C is the true canonical form of S, since 
every linear function of OJi, Xz^ y, in which the coefficient of xi is not zero, is the 
leader of a chain of length 2; while, if that coefficient is zero, it is the leader of 
a chain of length 1. If 6 0, take 

Zi — + 2/, =5 Xi + F — a;2 + ^ 2^2, 

Zz Xz “1“ cbY 

The determinant of the coefficients of Xif Xz, y is 
when 6 0 is therefore 

Z\ — zz) X2 = zs, Xs — — ahzi “h (6 — a®)z2 “f* 202^5. 

Example 2. There is a single chain, say with the leader xi — Wi ^ in the 
canonical form of 

Wi — wz-^- Ws, Wz-Wi-^- Wz, Ws ^ Wi + Wz Wz. 


= hxi + axz-h a^V ^ Zs. 

— 6. A true canonical form 
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43. Theorem 3. J/ di,..., d& are the characteristic determinants 
of the partial transformations on the variables of the firsts • • • > hth 
chains of a canonical linear transformation whose \-matrix M has 
n raws, and if gi denotes the greatest common divisor of all {n — 0“ 
rowed determinants of M, then 

gx ^ d^ •. • dtk, g2 ^ ds • * • dj, • • •) gk—i ~ d/js, g j ^ 1 ^ ^)* 


We employ the condensed notation 


Jkf- 


Ai 0 0 

0 As 0 


0 0 
0 0 


0 0 0 ••• 0 Ah 


where Ai, ..., A jt are the X-matrices of the partial transformations 
on the variables of the first,..., Ath chains of lengths a, ft,..., e. 
For example, A 2 is given by (17). Their determinants are 
di,..., d*. Let aij, bij, ... be the elements of Ai, A 2 ,... in the 
fth row and jth column. 

The minor Jkf' of an element in one of the 0 matrices of M is 
zero. For example, let M' be obtained by deleting a coltjmn con¬ 
taining elements of Ai and a row not containing elements of Ai. 
Evidently every a-rowed minor formed from the first a rows of M' 
has a column of zero elements. Hence Laplace^s development of 
M' according to the first a rows gives M' = 0. 

The minor of 61 in (17) is 1 . Similarly, the minor of Oai in Ai is 
1. Hence the minor of dai in Jf is gri = dfe • • • djfe, while the minor 
of any is evidently a multiple of gx. The minor of any ha is a 
multiple of di ds d 4 • • • d* and hence of gx, since di is a multiple of 
di by Theorem 2. The minor of any c^ is a multiple of dx dk d 4 . 
• • • dk and hence of gx- In this manner we conclude that gx is the 
greatest common divisor of all (n — l)-rowed minors of M. 

Any (n — 2)-rowed minor is zero if it is obtained from M by 
deleting two rows and two columns crossing in elements of a zero 
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matrix; the proof is the same as for M' above. The minor obtained 
by t iftlfi tin g from M the row and column crossing in ttoi as well as 
those crossing in bbi is evidently ga = ds * • • dj, while the minor 
obtained by deleting the row and column crossing in any an as 
well as those crossing in any is clearly a multiple of g^. The 
Tninrir obtained by deleting from M the row and column crossing 
in any an as well as those crossing in any c.-,- is evidently a multiple 
of da di" • • • dk and hence of ga- 
Proceeding in this manner, we have Theorem 3. 

44. Invariant factors of a canonical transformation. By (18), 
we have 

dl == (— 1)“ Ini da — ( 1)* In-h . . . j 

d*= 

where In-k+i are polynomials in X in which the coefficient 

of the hipest power of X is unity. Write 

(19') Ift-k = 1,..., fx = 1. 

It is customary to call I,- the jth invariant factor of the X-matrix 
of the canonical transformation C. Hence, apart from sign, the 
invariant factors distinct from unity are the characteristic deter¬ 
minants of the partial transformations, one for each chain of C. 

Thboebm 4. If Jj is ffie jih invariant factor of the Arrowed 
Tvmatrix M of a canonical tran^ormation, then I,- divides I,+i. If 
Oj is the greatest common divisor of aU flowed determinanis of M 
chosen so that the coefficient of the highest power of \ in Gi is unity, 
and if Go =‘ 1, then 

(20) Gi = Ii li " ‘ Ii (i — 1> • • • > ^)> 

(21) I, — (j = 1> • • • j 

so that the G’s uniquely determine the Ts, and conversely. 
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The first statement follows from Theorem 2. Next, if we write 
dj — 1 when j > k, we see by Theorem 3 that (?«_,• = ±d<+i 
-• • for 0 < n. By (19) and (19'), dj — dz la-j+i for 

1 g j g «. Hence we have (20) and therefore also (21). 

45. Theorem 5. If in a linear transformation mth the mairix A, 
we introduce new variables defined by a non-singidar linear trans- 
formoMon with the matrix B, we obtain one with the matrix BAB~K 

Using the same notation for a linear transformation and its 
matrix, let 


n 



A: 

Xi ^'E/aijXj 

(i = 1,.. 

,,n), 

(22) 

B: 

n 

Vi - 'El bih 

(i = 1,. - 

.,ra). 

(23) 

B: 

Yi^Y^hikXi 

It 

.. ,ra). 


Since 117^ 0, we may solve equations (22), and obtain 

n 

Xi='^Cj,y. (j = l, ...,n). 

Elimination of the Xk and the between (23), A, and gives 

JSJLJB Yi ^ OrJsj ^29 Vs (z = 1, . . • , 7 l), 

46. Rotations and orthogonal transformations. We shall illus¬ 
trate the introduction of new variables by an important example. 
If Wi, -W 2 juWi are the coordinates of an arbitrary point W referred 
to given;rec±angular axes OwiyOw 2 , Owz, we seek the coordinates 
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Pi) Pii Pa of the point P derived from W by the rotation about ON 
through angle a counterclockwise when viewed from N toward O. 
Let Til, na, n* be the direction cosines of ON. Choose new rectan¬ 
gular axes Ox, Oy, Oz, whose direction cosines with respect to 
Ovh, Owi, Owz are Zi, h, Z 3 ; mi, ma, ms; ni, na, Jis, respectively. Re¬ 
ferred to the new axes, let W and P have the coordinates x, y, z and 
X, Y, Z, respectively. Since the axis of rotation ON is now Oz, we 
have 

(24) X = a: cos a — y sin a, F = a: sin a -H p cos a, Z — z, 
by plane analytics. By solid analytics, we have 

X = liWi + hw!i + kwa, wi = hx + mi y + nih 

(25) p = mi Wi -[- ma lOa -f ms lOs, toa = Za a: -{- ma p -[- na 2 , 
z = niWi + n^uoi + na m, vjz = It x + may + ngz. 

Let D be the determinant of the transfomoation T defined by the 
first triple of equations. The second triple defines the inverse 
transformation T~^, whose determinant is derived from D by inter¬ 
changing rows and columns. Hence DD = 1 , Z) = ± 1 . After 
reversing if necessary the positive direction of one of tlie new axes, 
we have Z) = -|- 1. Solving the second triple of equations for z 
and comparing the result with g = «i wi -H . -., we see that 

(26) Za m® la m 2 ” fii, Zi ms — Zs mi — — 712 , li m 2 — Za mi — tia. 

We must express transformation (24) in the new variables 
Wi, Wa, Wa defined by (25) and 

p,- = liX + mtY + UiZ (i = 1 , 2 ,3). 

In the latter insert the expressions (24) and then the values of 
X, y, z from (25). We get 
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By the properties of direction cosines of perpendicular lines, the 
coefficient of cos a is equal to — w,* or 1 — nt, according as 
j 7 ^ i or j = i. The coefficient of sin a is one of the values (26) or 
zero in the corresponding cases. Write 

(27) d = cos ai = sin (i = 1, 2, 3). 

Then cos a == 2d^ — 1. Thus the coefficient of Wi in p* is 

(1 — n^) cos a + = 2d^ ^ 1 + 2{ni sin ia)^. 

Next, for j i, the coefficient of w, is 


Ui n,-(l — cos a) + e^i Uk sin a = 2ni rij sin^ 

+ 2e,*i Uk sin cos 

where h denotes that one of 1, 2, 3 which is distinct from i and 
while 

= fi 23 = 631 — + 1, exz «= C21 = 632 “ 1. 

Inserting the values from (27), we get 

1,2,3 

= (2d^ — 1 + 2a^) Wi + S + 2eyt a* d] ti?y, 


or, written at length, 

pi= ( 2 ai^+ 2 cP—l)«)i+2(oia2-asci)t02+2(aia3+a2cZ)t03, 

(28) P2=2(aias+a3d)«>i+ (2a2*+2(i®-l)«)2+2(a203-aid)w8, 

Ps =2 (ctiOs—aad)u>i"t "2 (0303+01^)102+ ( 2 o 8 ®+ 2 £p—l)t 08 . 

Theorem 6. Forrmda (28) represents the rotation dbovi the ams 
ON with the direction cosines n^, n^, n^ trough angle a countercloch- 
wise , when viewed from N toward 0. 
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The values of d, ai, 02 , az are given by (27), whence ai^ + 

“h Gz^ -h = 1. 

Consider a real ternary orthogonal transformation 

T: yi = biiXi biiXi btz xz (i — 1, 2, 3), 

whose matrix 5 is of determinant + !• Thus B'B = 7. If 7) is 
the determinant of R — 7 and hence of B' — 7, then 

B'iB-I) = - (B' -I), \B'\D= -D, 22) = 0. 

The point N = (ni, n^j uz) will be unaltered by T if 

Ui = bii ni + ^ + biz Uz (i = 1, 2, 3), 

which may be written as homogeneous equations having the deter¬ 
minant 7) = 0. Hence there exist real solutions ni, nz, not all 
zero. Let T replace the point X = (ici, iczy xz) by 7 = ( 2 / 1 , yz)- 
Then 

yi-Ui^ buisci - Ui) + bi 2 {x% - 112) + biz(xz — ns). 

Hence ^( 2 /i — UiY = ^{xi — UiY, for the same reason that 
^y^ = for the orthogonal T. Hence if 0 is the origin, 
OY - OZ, NY = NX. Thus the triangles OXN and OYN are 
congruent. Hence X and 7 lie on a circle whose plane is perpendic¬ 
ular to ON 3 so that T represents a rotation about ON. 

Any orthogonal transformation of determinant -- 1 is evidently 
the product of one of determinant + IhyX — x,Y y,Z = — 2 . 
The latter represents a reflexion in the xy-ph.ne. 

Theorem 7. Every real Urnary orthogonal transformation repre¬ 
sents a rotation or a rotation followed by a reflexioUj according as its 
determinant is + 1 or — 1. 

CoROLiiARY. Every real ternary orthogonal transformation of 
determinant + 1 is represented by formula (28). 
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Cayley^ expressed the r? coefficients of the 7 ^-ary orthogonal 
transformation in terms of — 1 ) parameters; but his for¬ 
mulas do not include all orthogonal transformations, except as 
limiting cases. See Exs. 3--6 of §54. 

This subject is only a special topic of the theory^ of linear trans¬ 
formations which leave unaltered a quadratic or bilinear form. 

47. Theorem 8 . Let A and B he n-rowed square matrices such 
that the elements of A are 'polynomials in X, while those^of B are ind^ 
pendent of X, and | B [ 7 *^ 0. Then the greatest common divisor Dt{h), 
with leading coefficient unity, of all Irrowed determinants of AB is 
equal to that of A. 

Let dt be the greatest common divisor of all ^-rowed deter¬ 
minants of A. By Theorem 5 of Ch. Ill, Dt is divisible by dt- 
Since A is the product’ of AB by the same theorem shows that 
dt is divisible by The two results imply Bi — d^ 

Since a like theorem holds for BA, we deduce the 

CoROLiiAET. Let A, P, Q be ?tr-rowed square matrices such that 
the elements of A are polynomials in X, while those of P and Q are 
independent of X, and P and Q are non-singular. Then the greatest 
common divisor of aU Crowed determinants of PAQ is equal to that 
of A. 

48. In particular, let A be the X-matrix of any linear transforma¬ 
tion r, the matrix of whose coefficients may also be denoted by 
T, so that A = r — X7. By the introduction of new variables 
defined by a linear transformation with the non-singular matrix 
Bj T becomes a transformation C7, the matrix of whose coefficients 

'Scott’s Theory of Detenmnants, revised by Mathews, 1904,197-202; Pascal’s 
Determinants, §§47-50, which cites twenty writers on orthogonal transforma- 
tions. 

*Encyclop4die des sc. math., t. I, v. 11, 47S-520. Cf. Muth, Elementartheiler, 
1899, 160-179. For Hermitian forms, Loewy, Nova Acta Leop. Carol. AJkad., 71, 
1898, 379-416; extract in Math. Annalen, 50, 1898, 657-76. 
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may also be designated by C. By Theorem 5, (7 = Write 

jD *=* C — XT. Then D — BAB^K In view of the Corollary, the 
greatest common divisor of all i-rowed determinants of D is equal 
to that of A. This result may be stated as follows: 

Theobem 9. Let any linear transformation T become C by the 
introduction of any new independent variables. Then the greaiest 
common divisor Gt of all t~rowed determinants of the \-matrix of T 
is equal to that of C. 

49. Canonical form determined by invariant factors. Let the 

new variables be chosen so that C is a canonical form of T. Since 
the X-matrices of these transformations have the same Gt, they 
have the same which, by (19), uniquely determine 

the characteristic determinants of the partial transformations on 
the variables of the various chains of C, and hence, by (17) and 
(18), uniquely determine the matrix of C. 

Theorem 10. For any linear transformation T, the quotient 
Gi/Gi^i is a polynomial in X called the jth invariant factor of 
the \~matrix of T. The matrix of the canonical form of T is uniquely 
determined by these invariant factors. 

In other words, any linear transformation has a single canonical 
form apart from the notation for the variables. 

50. Consider the transformation T defined by (1), (2), (3), 
etc., such that the characteristic determinant of (1) is divisible by 
that of (2), while the latter is divisible by that of (3), etc. Then 
T is its own canonical form, so that a,b,c,... have the properties 
stated at the end of Theorem 1. Hence we have 

Theorem 11, There exists a linear transformation with coeffir 
dents in any field F whose \-matrix has any prescribed invariant 
factors If with coeffidents in F such that If divides I,>i for every j. 
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51. Similar transformations. Two linear transformations (or 
matrices) S and T in F (i.e., with coefficients in F) are called 
similar in F if there exists a non-singular transformation 5 in F 
such that BSB-^ — T. In other words, S becomes T by the intro¬ 
duction of new variables defined by transformation B. Theorem 9 
states that the X-matrices of two similar transformations have the 
same Gt. 

Conversely, if the X-matrioes of S and T have the same Qt, 
then S and T are similar in F. Let U = QSQ-^ and V = BTR-^ 
be the canonical forms of S and T. Since the canonical matrices 
U and V have the same Gt, they are identical by Theorem 10. 
Hence 

= B-^VR==T, (B-^Q)S(,B-^Q)-^ = T, 

so that S and T are similar. 

Thboeem 12. Two linear tran^armations (or matrices) in F are 
similar in F if and only if their \-matrices have the same invariant 
factors. 

52. Invariant factors. Let M and N be any two rHcowed square 
matrices of which N is non-singular. Write A — MN-K Then 
M — W = (A— X7)iV has the same (?/ as A— Xl by Theorem 8. 
For the latter, = (rj/Gy-i is a polynomial in X by Theorem 10, 
and hence may be defined as the jth invariant factor of JIf — XAT. 

Removing the restriction that N be non-singular, let (?# denote 
the greatest common divisor of all j-rowed determinants of pM 
-t- trN, where p and <r are independent variables. Since each such 
determinant may be expanded according to the elements of a row 
(or column) and hence is a linear combination of certain C? — 1)- 
rowed determinants of pM -H a-N, Gf-i is a factor of every j-rowed 
determinant and hence of (?,. The quotient Gj/G,-i is therefore a 
homogeneous pol 3 momial in p and <r, and is called the jth invariant 
factor of pM + aN. 
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Exercises 

1. Xi = xt, Xj ■= aci, Xj = asj has the invariant factors X(X — a) and 
X — a. 

2. The invariant factors of 

■ 0 0 2 - X 

0 1 + X 2X 

, 2 - X 0 0 

are (X —;2)(1 + X) and X — 2. 

53. Classic canonical foim.^ For certain purposes a canonical 
form involving irrationalities is preferable to that in Theorem 1. 

Theorem 13. Start with ffie canonical form in Theorem 1, wiiJi 
coeffidenis in a fidd F, composed of partial tran^ormations of the type 

a 

(29) Xi = X2^ X 2 = X3, . . . I Xa-l = Xa, Xa = 23 

i—1 

Let the characteristic equation A(X) = 0 of (29) have the roots 
Ry S, Tf.,,, of multiplicities r, s,t,.,,, respectively» By the intro¬ 
duction of new variables Ui, Vi, Wi ,..., which are linearly independ¬ 
ent homogeneous linear functions of Xi,, Xa, (29) can he reduced 
to the classic canonical form 

{ Ui = Buif = Ruj + Uj^i (j = 2j..., r)f 

Vi = 8viy Vf = Svf Vj^i (j = 2j..., $)f 

Wi = Twif Wj = Twf + Wf^i (j = 2,..., 0,... 

Here the Ui (or Vi, Wi) are homogeneous linear functions of the x*s 
whose coefficients are polynomials in R (or S, T) with coefficients in 
F. If R and S are roots of the same equation irreducible in F,Vi may 
be derived from Ui by replacing R by S. 

^ Jordan, Traits des Substitutions, 1870,114-26, for transformations with integral 
coefficients taken modulo p of determinant prime to p. For any field, Dickson, 
Amer. Jour. Math., 24, 1902, 101-8; 
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Before giving a proof leading to explicit formulas for the Ui, Vi, 
, which imply (30) and their further properties, we shall verify 
a fortiori that (29) is similar to (30), when F is the field of all 
complex numbers, and hence can be reduced to (30) by the intro¬ 
duction of new variables Ui, Vi,.,.., formulas for which are, 
however, not found by this method. 

The \-matrix of the transformation in the first line of (30) is 


B-X 0 0---00 0 

1 12-X0--00 0 


0 0 o--oiie-x 


Let Aa and Ag be the X-matrices of the transformations in the 
second and third lines of (30). Thus the X-matrix of (30) is 


Ai 

0 

0 

0 ••• 

0 

As 0 

0 ••• 

0 

0 

As 

0 


The determinant of Ai is di = (22 — X) while the minor of the 
last element of the first row of Ai is imity. The determinants of 
As, As,... are ds = (jS — X)*, dt - (T — X)*,.... Then the 
determinant of Jfcf is A = di da ds..., while the first minors of M 
include ds d*..., di ds di..., di da di..., etc. Their greatest 
common divisor is 1, since it has none of the factors jB — X, 
5 — X,.... Again, the greatest common divisor of the first minors 
of the X-matrix of (29) is 1, since the minor of 6i in (17) is 1. 
Since the X-matrices of (29) and (30) therefore have the same 
invariant factors ±A, they are similar by Theorem 12. 

In other words, there exist new variables Ui, Vi,... whose intro¬ 
duction reduces (29) to (30). 

The properties of the %i, «<,... stated at the end of Theorem 13 
are important in various applications, but are not used in the 
present text. Some readers may prefer therefore to omit the fol- 


I 
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lowing proof of those properties in connection with a direct deduc¬ 
tion of (30). 

We again start with (29), in which the at belong to the field F. 
As in (18), the characteristic determinant of (29) is the product of 
(— 1)“ hy Co(X), where 

tt—• 1 

(31) C«_fc(X) s X»-* - S a.+fc+i (fc = 0, 1,..., a - 1). 

i—0 

Write Co (X) = 1 as the definition of (31) when fc = a. Then (29) 
replaces the function 

SO^) = 'E,XkCa-k(.'K) 

k-l 

by 

a 

Q(>-) =]S®jQ>- + 

J-2 

where 

a-j 

Qi = Ca-i+iO^) + cij “ 2 ai+j X* + ctj “ XCo-,*(X). 

i-O 

Hence 

(32) Q(X) — X/(X) ^ XC7a-i(X)] = 3 JiC7o(X), 

where the last reduction follows from the preceding result for 
j — 1. Byt differentiations with respect to X, we get 

(33) (X) - (X) - X/(‘> (X) s - Ca<‘> (X). 

Let jB he a root of Co(X) = 0 of multiplicity r, and write ui for 
f{R), and «(+i for the quotient^ of(22) by tl. Since the trans- 


iThe division is exact for each, coefficient. The entire proof is valid also when 
F is a so-called modular field in which the result of adding any element to itself p 
times is tiie element zero. An example is the field of the p residues of integers 
modulo p» 
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fonnatiou (29) replaces/(X) by Q(X) and hence replaces/^*^(X) 
by (X) for every X, it replaces Wi+i by (J2) -s- M, which we 
therefore denote by 17*41. Taking X = I? in (32) and (33), we get 

Ui = Bui, Vt-a — Rut+i + w* (< = 1,..., r — 1). 

Writings for f + 1, we obtain the first line of (30). By their origin, 
the Ui are linear functions of ®i,..., a;,, whose coefficients are 
polynomials in R with coefficients in F. 

It remains only to prove that the new variables are linearly 
independent. If 

r ‘ » t 

(34) s 0, 

j-i f-i j-i 

identically in xij..., Xa, we are to prove that the constants 
g, hy k are all zero. From the corresponding identity in the U, V, 
Wy we insert the values (30), subtract the product of (34) by B, 
and get 

2 Uj^i “ R)vj- +2 ^7 

7—1 7-2 

(35) 

+ J2ki(T — R)Wj+'^kjWi-i = 0. 

/-I 

If we can prove that the coefficients of Ui, v*, Wi in (35) are all 
zero, we shall have k* = 0 by w*, then kt^ = 0 by wt-i ,..., 
fci — 0 by Wi, and similarly every = 0 by Vi and every gt — 0 
(j > 1), whence (34) becomes gith = 0. But the coefficient of 
iSo in tti = f(R) is Co (22) = 1. Hence also gi = 0, so that all 
coefficients in (34) would be zero. ^ 

Since (35) is of the form (34), but lacks Wr, it suffices to prove 
that a relation of type (34) which lacks has every coefficient 
zero. The new identity (35) now lacks both m, and Ur-i. Taking 
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it in place of (34), we see by repetitions of this argument that the 
problem reduces to one involving only the »i and Wi. In the 
corresponding identity in V »and W <, we insert the values (30), and 
subtract the product of the former identity by S. Then as before 
we eventually obtain a relation involving only the Wi. This time 
we subtract the product by T. 

S4. Elementary divisors. In the classic canonical form (30) of 
(29), we saw that the characteristic determinants of the partial 
transformations, given by the successive lines of (30), are, apart 
from sign, 

(36) (\-Ry, (X-S)*, (X-DS..., 

whose product, apart from sign, is the characteristic determinant 
A(X) of (29), i.e., its unique invariant factor distinct from 1. 
These divisors (36) are called the elementary divisors of A(X) or 
of the X-matrix of (29). Recalling the definition (19) of the 
invariant factors of the X-matrix of any canonical transformation, 
of which our (29) is only one partial transformation, we obtain 

Theoebm 14. The elementary divisors of the \-matrix of any 
linear transformation are obtained from its invariant factors other 
than 1 hy taking the highest •power of each linear factor of each such 
invariant factor which is a divisor of the latter. Conversely, from a 
list of the elementary divisors written in any order, we may determine 
uniquely the invariant factors I j other than 1 by using the fact that I,- 
dtVndes J[ j+x* 

For example, let the invariant factors be 

h = 1, h = X, Is = X(X - 1)®, h = X®(X - 1)». 
Then the elementary divisors are 


X®, X, X, (X - D®, (X - 1)®. 
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Conversely, given the latter, we deduce as the product of the 
highest powers of the distinct Unear functions, then Iz as the 
product of the next highest powers, etc. 

The classic canonical form shows that there exists a Unear 
transformation on n variables whose X-matrix has any preassigned 
elementary divisors the sum of whose degrees is n. 

Our definition of elementary divisors applies only to X-matrices 
ikf — X/. In §52 we defined the invariant factors // of P - pM 
+ (tN, where M and N are any ?>-rowed square matrices. All the 
I i distinct from 1 may be expressed as products of powers of the 
same linear functions of p and o-, no two of which have propor¬ 
tional coefficients. Then those powers which have exponents > 0 
are called the elementary divisors of P. 

Exercises 

1. The three types of canonical linear transformations on three variables 
are 

U « Euj V ^ SVf TT = Tw\ 

XJi = 12ui, Uz ** Pwa + V *= Svi 

Ui = Ruif U 2 = Ru^ + Uif Ut Rv^ + ut, 

2. Find the elementary divisors of the X-matrix of each transformation in 
Ex. 1. 

3. Verify that formula (28) for any rotation in space is equivalent to the 
equation p — qwq^ in real quaternions 

q = d I -h oi i -j- 02 j + Os A;, tw « toi t *+• Wzj 4* Wi fc, 

p=‘Pxi + P2j + Pih, 

where q is of norm unity. See Ex. 6. 

4. The equation xym matrices 



defines a homogeneous linear transformation on a;i,..., Xi which evidently 
leaves invariant the quadric surface S defined by | a; 1 « 0; verify that it 
leaves invariant each of the lines 
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Lh *. * tel == te 

on 8, Similarly, ^ ^ yx leaves invariant each line 

h: — te, iC3 = kxij 

also on 8. Then^ ^ — zxy leaves 8 invariant, and permutes the lines L* 
amongst themselves and likewise the Z*. 

5. In Ex. 4 introduce new variables Xt and new parameters Yr such that 
8 becomes XX ^ « 0 : 

Xi Xt iXif Xi — Xi — iXif aj2 — — X 2 + iXs, xs = X 2 + 

yi-=Y4-iYt, 2/4 = Fi+tFi, y2 = Y2 + iY2, y, ^ -Y^^iY^, 

Then to xy corresponds S = XY in quaternions X = Xii + X 2 i 
■+* Xs fc + Z 4 Z. Thus if Z and Y are real quaternions, S = ZXY gives every 
real linear transformation of positive determinant which leaves XX ^ 
invariant up to a constant factor.® 

6 . Deduce from Ex. 5 the real orthogonal transformations on three vari¬ 
ables. Take X 4 = 0, S 4 = 0. In other words, take X + X' = 0, S + S' =* 0. 
Hence shaU ZXY = 7'XZ'. Write v for Z' Y^K Then p'X « Xp. The 
case X — i shows that the coefficient of ^ in p is zero. By symmetry, the cases 
X j and X = fc show that the coefficients of j and /b in p are zero. Hence 
p = rZ, where r is a real number. Thus Z' = rY and S = ZXZ' r~\ 
Taking norms, we see that the condition that Xi® + X 2 ® + X 3 ® be invariant 
is n(Z) = ± r, which is therefore positive. Write q for the real quaternion 
Z(± r)"i of norm unity. Thus S = ± ffXg'. The sign is in fact + or—, 
according as the determinant of the orthogonal laransformation is + 1 or 1. 
This gives a new proof of Ex. 3. 


® These transfoiniations and their products by (x 2 X 3 )t which interchanges the 
two sets of lines, are the only linear transformations leaving S invanant. Dickson, 
Bull. Amer. Math. Soc., 22, 1915, 53-61. 

®For an elegant proof using only quaternions, see Hurwitz, Zahlentheorie der 
Quaternionen, Berlin, 1919, 62-66r 
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PAIRS OF BniNEAE, QUADRATIC, AND HERMITIAN 

FORMS 

55. Histotical note. For a pair of bilinear or quadratic forms 
(the second being non-singular), the theory was first obtained by 
Weierstrass^ by developments in series, which introduce certain 
square roots. There was a serious gap in his proof. For pairs of 
bilinear fom^, all irrationalities are avoided by the method of 
Frobenius.® However, the resulting rational criterion for the 
equivalence of pairs of bilinear forms in any field may be proved® 
by simple modifications of Weierstrass’s method. We shall give 
here a still simpler rational method based on the theory of linear 
transformations. 

56. Equivalence of two pairs of matrices. Two pairs oi nXn 
matrices M, N and U, V, with elements in any field F, are called 
equivalent in F if and only if there exist non-singular w-rowed 
matrices P and Q with elements in F such that 

(1) PMQ = U, PNQ = V. 

We shall first treat the case in which N and V are non-singular 
(of. §57). Suppose that two such pairs are equivalent. Then 
MN-^ s= J" is similar to UV~^ =* W since 


P/P-i = PMQ • Q-^N-^P-^ = UF-i = W, 


^Berlin. Beric^te, 1868, 310; Werke, H, 19. 

* Jour, fiir Math., 86,1879,140-208 (see p. 202). For a report on this and related 
papers, see the author’s History of the Theory of Numbers, III, 1923, 284-8. 

_ ’Dickson, 'Trans. Amer. Math. Soo., 10,1909, 347-60. With the same modifica. 
tions, the method was extended to Hermitian forms by 'M. I. Logsdon, Amor, 
Jour. Math., 44, 1922, 254-60. 
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so that the X-matrices of J and W have the same invariant factors. 
By §52, the X-matrix J — yJ oi J has the same invariant factors 
as 

(J - Xl)iV = (ATJV-i - Xl)iV = M - XiV. 

Likewise, the ^matrix TV — XI of TV has the same invariant 
factors as 17 — XF. Hence M — \N and U — W have the same 
invariant factors. 

Conversely, the last result implies that the X-matrices of J and 
W have the same invariant factors, whence J and W are similar in 
F. In other words, there exists a non-singular matrix P in F such 
that PJP-i = W. Then 

P(J - XJ)P-» = IF - XI, 


P(MiV-i - XI)P'i = VV-^ - XI, 


P(M - \N)N-^P-^ = (H - X7)F-*. 

Writing Q for N~'‘- P~^ V, we get 

P(M - ■SN)Q = f7 - X7. 

By the terms free of X and those containing X, we get (1). 

Theoebm 1. Two pairs of nX n matrices M, N and U, V, with 
elemerUs in any field F, such that N and V are non-singular, are 
equivalent in F if and only if M — XiV and U — XF have the same 
invariant factors, or, if we prefer, the same elementary divisors. 

We evidently have the same criterion for the equivalence in F 
of two pairs of bilinear forms with the matrices M, N and U, F. 

S7. Next, we omit the assumption that the determinant | JV | of 
N is not zero, but assume that \M + xlV | is not zero identically in 
X. Thus we can select a number w 5 ^ 0 of P such that B = M 
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+ vN is not singular. Then if the pairs N and U, V are equivar 
lent in F, also the pairs M, B and Uj C = U + vV are equivalent 
in F, and C is not singular, and conversely. Hence by Theorem 1, 
the pairs M, N and U, V are equivalent in F if and only if M — XR 
and U — \C have the same invariant factors or the same ele¬ 
mentary divisors. The latter property holds if and only if the 
matrices pikf + (tN and pU + aV have the same elementary 
divisors, as will be proved by means of the following 

Lemma. IfA= pM + qN, B == uM + vN, where pv — gu 9 ^ 0, 
so that pA + tB — pM + <rN for 

(2) p = pfL + UTj a — qp + vt, 

then the elementary divisors of pA + rB are derived from those of 
pM + <rN by the transformation (2). 

Let mp + no- be a factor of all j-rowed determinants of pM 
+ (tN and let e be the exponent of the highest power of mp + n<r 
which divides all those determinants. The transformation (2) 
replaces any element of pM + a-N by the corresponding element 
oi pA + tB, and hence replaces any j-rowed determinant of 
pM -b (tN by one of jliA + tB. Hence if (2) replaces mp + ncr by 
ap + br, (ap + 6t)® is a factor of all j-rowed determinants of 
pA + tB, It is, moreover, the highest power of ap + br which 
divides those determinants. For if they were all divisible by 
(ap + hr) where g > e, we see by applying the inverse of trans¬ 
formation (2) that (mp + n<r)o would be a factor of all j-rowed 
determinants of pM + aN, contrary to hypothesis. The Lemma 
now follows from the definition of elementary divisors in §64. 

We may employ the Lemma when p — u = Ij q = 0, v 9 ^ 0, 
Then A — Af, 5 = M + vN. Hence the elementary divisors of 
PlM + tB are derived from those of pM + oN by the transforma¬ 
tion P = P + r, (T ^ VT. Thus, if C == U + vV, then pM + tB 
and pU + tC have the same elementary divisors if and only if 
pM + aN and pU + aV have the same elementary divisors. 
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Taking /x — 1, r — X, we have the notations employed in the 
discussion preceding the Lemma. This proves 

Theorem 2. Let M, JV, J7, V be any Tinrowed square matrices 
with elements in any field F such that neither of the determinants 
\pM + <rN\and\pU + ffV\ is zero identically in p and <r. Then the 
pair M, N is equivalent in F to the pair U, V if and only if pM + cN 
and pU + <rV have the same invariant factors (or the same elementary 
divisors). 

For brevity we shall speak of the invariant factors of pM + cN 
as the invariant factors of the pair of bilinear forms having the 
matrices M and iV*, and speak of the case in which | pM + o-AT [ is 
not zero identically in p and cr as the non-singular case. Then 
Theorem 2 may be stated in the following briefer form: 

Theorem 3. In the nonrsingidar case, two pairs of bilinear forms 
in Xi,..., Xn, 2 / 1 , ... , with coefficients in any field F are equiva¬ 
lent under non-singular transformations in F if and only if they have 
the same invariant factors. 

58. Canonical forms of a pair of bilinear forms. Let their ma¬ 
trices be M and N, where N is assumed for the present to be non¬ 
singular. The pair M, N is equivalent to the pair J = 

= I. If P and Q are non-singular and such that PJQ = W, 
PIQ — I, then Q = P”^. Hence the pair J, I is equivalent to the 
pair W, I if and only if PJP^^ = W, i.e., if J and W are similar 
matrices. 

When W is the matrix of the canonical form of the linear trans¬ 
formation of matrix J = we shall call W, I the canonical 

pair of M, N. 

When all the elements of M and N belong to a field F, we may 
choose W to be the matrix of the canonical form with coeflSicients 
in P of Theorem 1 of Ch. V. This proves 

Theorem 4, Any pair of bilinear forms in Xi,, x^ 2/i,..., 2/n 
with coefficients in any field F, such that the second form is non- 
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singidar, can be reduced by a nonrsingidar linear transformaMon on 
ffie z’s and one on the y’a, each with coefficients in F, to a unique 
canonical pair 


«-i ' 'i-i i-i ' 


a h 

“22/<+S«<«’<+•••» 

£*■1 £«*1 

where a + & + • • • = n. If there are h of the numbers ^the 

invariant factors of f — Xgr are 1,..., 1,, 

/«-! “ - i: X<-S 

i -1 {-1 

Since the at, bi,... are any numbers of F such that divides 
I i+i for every j < », we have the 

CoEOLLABT. There exists a pair of bilinear forms /, ^ in n + n 
variables with coefficients in F such that / — Xg has any preas¬ 
signed invariant factors I,- with coefficients in F such that 1/ 
divides I,+i. 

Next, if we choose W to be the matrix of the classic canonical 
form of Theorem 13 of Ch. V, we obtain 

Theoebm 6. Within the field of all complex numbers, any pair of 
bilinear forms in n-\-n variables, of which the second form is non- 
singular, can be reduced to the dassic canonical pair 

4> = +S + (^ 2 • • • I 

' i -1 M ' i -1 {-2 ^ 


r < 

®i + S i?i ^i +..., 

£-1 t-l 
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where r + s + •••=«. The denmdary divisors of ^ are 

C 0 HOLLAB.Y. There exists a pair of bilinear forms 4> and ^ in 
n + n variables, of which tp is non-singular, such that <l> ~ has 
any preassigned elementary divisors the sum of whose degrees 
is n. 

We now omit the assumption that | iV | 0, but assume that 

I pM 4- o-N I is not zero identically in p, <r. Choose u and ti in iP so 
that B = uM -t- vN is non-singular. Then choose p and g in jP 
so that p) — qu = 1, and write A = pM -f qN. Then 

(3) M = vA — qB, N = — vA pB. 

The pair of bilinear forms with the matrices A and B can be 
reduced rationally to a pair / and g of Theorem 4. The same trans¬ 
formations reduce the pair with the matrices M and N to the pair 
c = vf — qg and d = — uf + pg. We have pc + rd = pf + rg 
for 

(4) p = vp -Uff, T = —qp -1- po, 

which give the solved form of (2). The invariant factors of 
pf + rg reduce to those of / — Xg by writing p = 1, t = — X. 
Conversely, if we make the invariant factors of / — Xg homogene¬ 
ous in X, p, and then replace X by — t , we get the invariant fac¬ 
tors of pf -I- Tg. The transformation (4) replaces the latter by 
those of pc + (xd. This proves 

Theokem 6. In the non-singular case, any pair of bilinear forms 
in the variables * 1 , gi,..., y„, with coefficients in any field 

F, can be reduced by a nonrsingidar linear transformation on the x’s 
and one on the y’s, each with coefiicients in F, to a canonical pair 
c — vf — qg and d = — uf pg, where v, q, u, p are numbers in F 
such thal pv — qu = 1, while f and g are the forms in Theorem 4. 
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The invariant factors of pc + <rd are obtained by applying trans- 
formation (4) to those of yf + rg, viz., 

1 ,- 1 = 

;-0 

a~l 

= T® - 23 ^?+l 

If (X — iS)’’ is an elementary divisor of A — XB, then (r 
+ juB)'* is one of M + rB, and conversely. By the Lemma in 
§57, the inverse (4) of transformation (2) replaces an elementary 
divisor (r + plR)^ of M + rB by an elementary divisor (mi p 
+ ni cr) »* of pM + (tN. Since only the ratio of mi and ni is material, 
we may take miu + niv = 1. Then if J2 = mi p + ni we have 
mi p + «-! O’ == r + pR. By Theorem 5 we can reduce the pair of 
bilinear forms with the matrices A and B to the canonical pair 
<t> and Then that with the matrices (3) is reduced to the pair 
a ^ v<j) ^ q\l/ and jS — -- u<t» + whose expressions reduce to 
those given in Theorem 7. For, pt; — gw = 1, mi w + ni= 1, 
mi p + ?ii g = iZ imply mi = t;iJ — g, wi = p — uR; and, when 
we employ S instead of 22, we replace the subscripts 1 by 2 and 
hence have rriz — vS’— q^rh — 'p — uS, 

Theorem 7. In the non-singvlar case, any pair of bilinear forms 
inn + n variables can be reduced to the classic canonical pair 

/ «• r K ' 

a = {miYh 

^ <-l t-2 

+ Vi Vi + Vi + • • • I 

i«l i=-2 ^ 



§59] 


PENCILS OF BILINEAR FORMS 


119 


P = ^iXi 





+ (nzj^niyi — +- 

^ i -1 . <-2 *' 

The elementary divisors of pa + or/3 are 

(W2rl p "f* W-l C7) **, (7712 P “h ^2 <^) % • • • • 

Corollary. There exists a pair of bilinear forms a and ^ in 
n n variables such that pa + o-jS has a determinant which is not 
zero identically in p, cr, and has any preassigned elementary 
divisors the sum of whose degrees is n. 

Exercises 

1. Two qiiadratic forms with the matrices A and B have as invariants of 
index 2 the coefficients of the various powers of X in the determinant of 
A - X5. 

2. By means of Theorem 5, write down the three types of canonical pairs 
of bilinear forms when w = 3. 

59. Pencils of bilinear forms. Instead of a pair of bilinear forms p> 
and V, with coefficients in F, consider the pencil [fi, v] composed 
of aU bilinear forms p/x + where p and a- range over F. This 
pencil coincides with the pencil [a, /3], where 

( 5 ) a — PPL + qvj = upL + vv, pv — qu^ 0 . 

The pencil is called non-singular if the determinant of p/x + (tv 
is not zero identically in p, a*. 

A second such pencil, [pi, i^i], is equivalent in F to the pencil 
[p, v] if and only if the pair pi, vi is equivalent to some pair (5) 
of the pencil [p, v]. 

In view of the Lenuna and Theorem 2, we have 
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Theorem 8. Let M, N, Mi, Ni be the matrices of the bilinear 
forms n, V, Hi, vi with coefficients in any fidd F. The rum-singidar 
pencils [/t, v\ and [mi, n] are equivalent in F if and only if all &e 
invariant factors {or elementary divisors) of pi Mi + ai Ni can be 
derived from those of pM + cN by the same non-singular linear 
transformation expressing p and <r linearly in terms of pi and <ti with 
coefficients in F. 


60. The nth roots of a matrix. We shall need the 
Lemma. If a,b,c,... are distinct and not zero, and 


^{x) = (a: — a)>- {x - b)‘ {x — cY ..., 


ffiere exists a polynomial x(a:) such ffiat [x(a:)]" — « is exactly 
divisible by ^(a:). 

For example, ifra = 2andi^ = (a: - a)\ then 


X = §0"* (a: + a), x* - a: = ia-i (x - a)\ 


To prove the Lemma, consider the expansion (by the binomial 
theorem) of ^'^~x in a series of ascending powers of a; — on 


^1+ (a:-a)/a 
= -^[l + 5.^ + (j— 


na 


2n^a^ 


+ 


■]- 


and wnte F(x) for the sum of the first r terms, so that V® 
- F(a:) is termwise divisible by (a: - a)>-. Similarly, let C?(aj) be 

the sum of the first s terms jDf the expansion of in a sAriAp of 
powers of a; — 6, so that — G(a:) is divisible by (x — 



§60] 


THE iVTH ROOTS OF A MATRIX 


121 


By decomposition into partial fractions (or by expansion in a 
series of ascending powers of a: — a), let 


Fix) 


Ao 

(a; — o)’" 


+ * • • + 


Ar—l 

X — a 


+ 12(a:) 


A(x) (z — ay P(x) 
(x — a) ^ i/f(x) 


where the A{ are constants and R(x) is the quotient of a poly¬ 
nomial P(x) by ^{x)J{x — ay. Multiplication by ^{x) shows 
that the first r terms in the expansion of ip{x)A{x)/{x — a)' 
in a series of ascending powers of a; — a have the sum F{x) and 

hence are the same as the first r terms in the expansion of 

In other words, the difference between this fraction and is 
divisible by (a: — ay. 

Determine B{x) from Q{x) just as we determined A (x) from 
F(x). Then the difference between ^Tx and the second term of 


^;'(a;)A(a;) . 4>{x)B(x) , 

-«)'■+ (x-b). + 

is divisible by (a; — 6)*. The first, third,... terms are divisible 

by (a;— 6)* since \p(,x) is. Hence x — ^ is divisible by 
(a; — 6) * and similarly by (a: — a) , and hence by their 
product (a;). 

TTron -RBiivT 9. If X is a non-singular square matrix, there exists 
a polynomial x(X) suck that the rdh power of the matrix x(.X) 
is X. 

If ^ (X) is the characteristic determinant of X, then <}> (X) = 0 
by §25. Since X is non-singular, the constant term of <f> (X) is not 
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zero. Hence we may employ <l> (X) as the ^ (X) of the Lemma and 
conclude the existence of polynomials x(X) and q(\) such that 

IxMY - X = <A(X)^(X), 

identically in X. Since the coefficients of like powers of X in each 
member are equal, and since we conclude that 

[x(X)]»-Z = 0(X)g(Z) -0. 

61. Equivalence of pairs of quadratic or Hermitian forms, or 
symmetric or Hermitian biUnear forms. We saw in §56 that the 
theory of equivalence of pairs of bilinear forms may be developed 
rationally in any field. But this is not true of pairs of quadratic 
forms, in the investigation of which we shall employ Theorem 9 
and hence introduce irrationalities. Whatever method be em¬ 
ployed, irrationalities cannot always be avoided, as is clear from 
the following example. The real quadratic forms — 2y^ and 

— 32/^ are transformed into and respectively 

by the transformation x ^ z,y — (— 1)^ ^t?, but by no real trans¬ 
formation, since the first forms have real linear factors while the 
second forms do not. We shall therefore study equivalence with 
respect to the field of all complex numbers. 

Consider four bilinear forms a, jS, y, d whose matrices A, 5, 
(7, D all have the property X' - By Theorem 7 of Ch. Ill, 
the pair a, p is equivalent to the pair y, d imder a non-singular 
transformation on the x^s with matrix E and the transformation 
on the 2/^s with matrix E if and only i£ 

(6) C-^E'AE, D^E'BE, 

According as Si denotes X or X, our forms are symmetric or 
Hermitian bilinear forms. If we identify 2 /,-with r,* for j —1, ..,n, 
we conclude that two pairs of quadratic or Hermitian forms are 
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equivalent if and only if their matrices satisfy (6), where S — E 
or ^ in the respective cases. 

Thbohem 10. If the four n-rowed square matrices A, B, C, D 
have the property X' = X, auA if there exist non-singidar ro-rowed 
matrices P and Q such that 

(7) C = PAQ, D = PBQ, 

tten Oiere exists a non-singiilar iv-rowed matrix E for which equations 
(6) hold. 

■ By hypothesis, 

( 8 ) C =-C' I'P' = AP'. 

Equating this to C = PAQ, we get 

PA = AP'Q-K 

Write 

(9) X=&)-^P, 

so that X is non-singular. Then 

= P' Q~\ 

whence 

XA = A^', X^A = XAl' = A (X')\ 

By induction on k, we get 

( 10 ) X'‘A=A{X')K 

Let x(a:) = c* a:* be any polynomial in x. Multiply A = A 
by Co, and (10) by c* for A: = 1, 2,..., and add; we get 

(11) x(X)A = Ax(l'). 
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By Theorem 9, we may dioose complex numbers c* so that the 
square of the matrix V = x is equal to X. Since X is non¬ 
singular and F® = X, also F is non-singular. Then (11) gives 


A = F-Mf'. 


Thus (7i) becomes 

C = PV-^Af'Q. 


Write E = F'0 . From F* = Z and (9), we get 0' F = PV-K 
Hence C — E' A^. This proves the first equation (6). 

Since X and F and hence also E depend only on P and Q, but 
not on A or C, the second equation (6) follows similarly from the 
second equation (7). 

Theorems 10 and 3 evidently imply 

Theobbm 11. In the non-singuJar case, two pairs of symmetrie 
(or Hermiiian) bilinear forms are equivalent under a non-singular 
linear iransformaMon on the a^s and the cogredimt (or conjugate) 
trcensfmnoMon on the y’s if and only if they have the same invariarU 
factors. 

From this and Corollaries 1 and 2 of §§33, 34, we have 

Theoeem 12. In the non-singular case, two pairs of quadratic or 
HenmUan forms are eguivcdent under non-singidar transformaMon if 
and only if they have the same invariant factors. 

Consider the case in which the second quadratic or Hermitian 
form of each pair is ^ whose matrix is the identity matrix I. 

Hence Theorem 12 implies the 

CoEOiiLAET. Two quadratrc (or Hermitian) forms with the 
matrices A and C are equivalent under an orthogonal (or unitary) 
transformation if and only if the X-matrices A — \I and C — X? 
have the same invariant factors. 
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The conditions that two pairs of real quadratic forms shall be 
equivalent under real transformation are complicated.* 

ExnBciSES 

1. Every binary non-singular quadratic form is equivalent under an 
orthogonal transformation (not necessarily real) to one of the following: 

aa;2 -h &2/2(a 7*^ &); ax^ -h ay®; 2ax^ -f 2aixyf 
whose invariant factors are (a — X) (& — X); X — a, X — a; (a — X)®. 

2. In the non-singular case, two quadratic forms / and h can be reduced to 
forms involving only squares of the variables if and only if their elementary 
divisors are all of the first degree. Hence the latter is true by §37 when / is a 
real quadratic form and h is positive and non-singular. State the correspond¬ 
ing theorems for Hernutian forms. 

62. Pairs involving alternate forms. A bilinear form with the 
matrix A is called alternate if A' =* — A, and alternate Hermitian 
if A' = - A (cf. Exs. 4, 6 of §38). 

Theorem 13. Theorem 10 holds also when 

A' = - A, C = -€, 5' = ± 5, J5' = ± 5, 

For, (7i) then implies 

C = - = + Q'AP', 

SO that (8) holds also here. The rest of the proof of Theorem 10 
applies here unchanged. 

Theorems 13 and 3 imply 

Theorem 14. In the non-singular case, two pairs of forms a, p 
and 7 , d, of which a and y are alternate bilinear form {or alternate 
Hermitian forms), while and d are. either alternate or are symmetric 
{or Hermitian) bilinear forms, are equivalent under a norirsingular 
transformation on the x^s and the cogredient {or conjugate) transfor¬ 
mation on the y^s if and only if they have the same invariant factors. 

1 Muth, Jour, fur Math,, 128, 1905, 302-21. For pairs of binary quadratic forma 
in any field, see Dickson, Amer. Jour. Math., 31, 1909, 103-8. 
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If O' is a S 3 niimetric and p an alternate bilinear form, any ele¬ 
mentary divisor of pa + or/3 of one of the special types or 
is one of an even number of equal elementary divisors.^ 

63. Pairs of forms with preassigned invariant factors. 

Theorem 15. There exists a pair of quadratic (or Hermitian) 
forms and ^ with coefficients in the field F such that ^ is novr 
singular and the matrix of has any preassigned invariant 

factors Is with coefficients in F (or in the real subfield S of F)t where 
Is divides for every s. 

If there are t invariant factors distinct from unity, we shall 
prove that we may take <t) as, X) where as — X/S® 
has the single invariant factor Is distinct from unity, and where no 
variable of as or Ps occurs also in ai or /3» for i 9 ^ s (cf. the proof 
in §43). In the further discussion we shall omit the subscript s. 

For an odd number 2n — 1 of variables, we may take 

" n 2n-2 

a = ^ y an Xi Xi -f* 'y ) X2n—i 

n-l 

(12) ( +E (a i i^i Xi Xi+i + i Xi+i Xi), 

2n-l 

^ ^ V Xi X2n—i) a^i — ttij*. 

L i=.l 

We shall prove that the determinant An of a has the value 
A„ = (- +Za» X2-2 

+ 2 ^ (ai^i i + a^ 

<—2 

^Kjonecker, Monatsber. Akad. Berlin, 1874, 397; Werke, I, 423. Proof by Fro- 
bemus in Encyclop4die des Sc. Math., 1.1, vol. II, 463-9; and in Miith*s Elementar- 
theiler, 1S99, 135-142, 231-2. Stickelberger, Jour, fiir Math., 86, 1879, 42-43. 
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We have 


A„ = 


Oil 

On 

0 


0 

X 


Cm 0 0 

022 0 
032 O33 O34 


X 1 0 

10 0 


0 0 

0 - X 

X 1 


0 

0 


0 

0 


- X 
1 
0 


in which every element of the secondary diagonal is — X, except 
the middle element a„„ — X, while every element just below that 
diagonal is 1, and the further elements below it are zero. Hence the 
minor of ou is ± 1. Multiply the first row by 1/X and add to the 
second row; then multiply the first column by 1/X and add to the 
second column. Now — X is the only element 7 ^ 0 in the last row 
and last column. Hence A„ = — X® d, where d is the determinant 
obtained from A„ by deleting the first and last rows and columns 
and replacing 022 by 


hz = Os2 -h (on + a2i)/X + Uu/X®. 


We see that d is derived from A»_i by replacing every 0,7 by 
Oi+i) /+i except Ou, which is replaced by & 22 . lu a proof of the for¬ 
mula for An by induction on n, we assume the formula when n is 
replaced by n — 1. In the latter we make the preceding replace¬ 
ments and hence get d; its product by — X® is seen to reduce to 
the expression for A„. Finally, Ai = Ou — X is the determinant of 
(an — X)®! fi, to which a — X;3 reduces when n = 1. Hence the 
induction is complete. 

Hence the matrix « — X/3 has the invariant factors 
±An. By choice of the Oif, ±A„ can be identified with any given 
polynomial in X of degree 2n — 1 whose leading coefficient is 1 and 
the remaining coefficients g have the property §r = g. 

For an even number 2n of variables we employ 
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a 


t -1 


( 13 ) i 


n—1 

+ S »+l ^i+1 + <^i+l i ^<+1 ^() 

<-X 


2n-l 

+ S iC<+l ^n41-< + $211 + 

t -1 


2n 




i-1 


For n = 1, the determinant of a — X/3 is 


( 14 ) 


ttii fif X 
g - X 1 


— [X® - (sr + g')x + — au]. 


For n > 1, the determinant A of a — XiS is 

flu Ois 0‘”0 0 0 0 0 0--"0 0 (?—X 

Ojl 022 ^*23 •••0 0 ^0 0 0 0*'*0 —\1 

0 a«jaj8”-0 0 0 0 0 0 -X 1 0 


0 0 0 * • • On—iln—2 On—ln-4 On—In 0 ““X1'‘*0 0 0 

0 0 0 •••0 Om-l Onn —X 1 0 •••0 0 0 

0 0 0 ••• 0 0 —X 1 0 0 ••• 0 0 0 


0 -X 1 ••• 0 0 0 0 0 0 ••• 0 0 0 

cf-x 1 0---0 0 0 0 0 0---0 0 0 

Multiply the (n + l)th row by X and add to the »th row; than 
multiply the (n + l)th column by X and add to the nth column; 
we get a like dete rm inant having a„„ replaced by = Onn — X®, 
and having the terms — X in the nth and (n + l)th rows replaced 
by 0 . If n = 2, we have (16) with 622 = 022 — W 
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If n > 2, multiply the ?zth row by X and add to the (w — l)th 
row; then multiply the nth column by X and add to the (ra - 1 )th 
colunan; we get a like determinant Wving On-i »-i replaced by 

(16) 6 n_l n_i = On-i n-1 + X(fl 6 n_l n + n-l) + X® 6 „n, 

having Xb»„ added to both On n-i and On-i« (which does not effect 
our final result), and having the terms — X in the (n — l)th 
and (n + 2)th rows replaced by 0. Thus A is the negative of the 
determinant obtained by deleting the rows and columns numbered 
n, n + 1 , and n + 2 . However, we shall continue to number the 
rows as in A. If n = 3, we have (16), where 622 is given by (15). 

If w > 3, multiply the (n — l)th row by X and add to the 
(n - 2)th row; multiply the (n - l)th column by X and add to 
the (w — 2 )th column; we get a like determinant having an -2 «-2 
replaced by 

bn _2 »_2 = n _2 + X(a„_l »_2 + fltn -2 n-l) + X® 6 n-l »-l, 

having Xbn-i n— 1 added to both a»_i n_2 and On_2 n_i, and having 
the terms — X in the (n — 2)th and (n + .3)th rows replaced by 
0. Thus A is equal to the determinant obtained by deleting the 
rows and columns numbered » ~ 1 , ra, n + 1 , + 2 , « + 3 . 

Proceeding similarly, we see that 


( 16 ) 

Hence 


A = (- 1)»-® 


011 012 g — X 

(hi bsi 1 

g-\ 1 0 


A ( !)’*[ Oil "b oi2 g “b Cfei fif — (ffli2 "b 02i)X 

— 622(9 ~ 5 ^) (9 — X)], 

where 622 may be computed by means of the recursion formula 
6« = an + X(ai i+i + 04+1,) + X® 6 t+i 4+1 (i = 2 , ..., ra — 1 ), 
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and &„» = a»n — X*. Thus 

622 = — +2 X**“* +2 (ffli <+i + di+i i)X**"®. 

i =»2 i ^2 

Hence we obtain the expKcit formula 

(17) (- 1)" A = - (fir + + (gg — ann)X 2«-’2 

+ [“ Ctn-l n dn n-1 + (S^ + S')ctnn]X^”“^ 


n-1 

— 2 ■” (fl^ + S^) (®» *H + ») 

*=.2 

+ ggdi^i »-4.i]x®*“^ 

n —1 

— ^ [flti-i i + cii i_i + ggidi t*+i + tti+i i) — Ufi (fif + ?)]X^*"^ 
i >-3 

+ [— CH 2 — <*21 + (s^ + S^)<*22 “• gg (<*23 + <*32)]x 
<*11 + <*12 S ^ + <*21 fif — <* 22 ? \ 

where, by (14), only the first three terms are retained if n = 1, 
while if = 2 we retain the first three and last two terms, and 
omit the last part of the coeflicient of X. 

We can identify (17) with any polynomial X^" + • • • with 
coefficients in F or 5, according as x denotes x or x. This may be 
done by choice in turn of the letters written below the proper 
powers of X: 


X2»-i^ X2»“2^ X2«-«,..., X^^-i, X2^-2,... 

Q g^ **»«J <*n—1 n <*n n—1> ••• 3 dj <*j+l j, — djj, . . . 

where j takes values decreasing to 1 inclusive. We have here 
omitted from the complete coefficient of any X^ the terms involving 
only letters appearing under , X*'*'^ in the table. 
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64. Weierstiass’s canonical pair of quadratic forms. Whether 
the number of variables is odd or even, we have exhibited a pair 
a, j 8 of forms (12) or (13) whose invariant factors are 1,.. ., 1, 
where I is any preassigned polynomial. Omitting the signs - over 
X, we have the case of quadratic forms. Let 

k 

/ = n (X - x /)« i . 

j-i 

i 

If each Cj and each dj is not zero, the pair 

3—1 ^ i—1 i—l 

... 

6 — C j <^e i 

. y-i i-i 

has the invfiriant factors 1,..., 1,1. For, the matrix of y — X3 
has the abbreviated notation 


Ml 0 0 • • • 0 0 
0 M 2 0 • ■ • 0 0 


0 0 0 ••• 0 M* 


where 0 is a zero matrix, and M/ is the e,-rowed square matrix of 
those terms of 7 — X5 in which 3 has a fixed value: 



0 0 0 • • • 0 d, CjO^i — \) 

0 0 0 ■■■ df Cj{\i - \) 0 

dj (V (Xf - X) 0 • • • 0 0 0 

c# (X/ - X) 0 0 • ■ • 0 0 0 


The elements of the secondary diagonal are aU c,(X,. — X), those 
just to the left of them are aU dj, while all further elements are 
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zero. Apart from constant factors, the determinant of Mj is 
(X — X;)^? and the minor of the last element of the last row is 
= constant 0. As in the first part of §53, the invariant 
factors of r — Xd are 1,..., 1,7. Hence the pair 7 , d is equivalent 
to the pair a, 

In §63, we obtained a pair of quadratic forms 4 > = 2 
^ = ^^8} having exactly t prescribed invariant factors 7« 5 ^ 1, 
where oj, — X/S, has the invariant factors 1, ..., 1, In place of 
a,, we have now found that we may take a pair 7 «, d* of type 
(18). Hence 23d« have the t prescribed invariant factors 
7 *^ 1. Changing the meaning of k, we may now suppose that 
(X — X,)®/, for j — 1,..., k, give all the highest powers of linear 
functions which occur among all the invariant factors (instead of 
those for a single one I as before). Hence we obtain 

Theoeem 16. There exists a pair (18) of quadratic forms hiving 
any prescribed elementary divisors 

(X ““ Xi)®i, • • •, (X Xjfc)®^. 

For example, let the number of variables be three. 

(i) Cl = 62 = 63 = 1. Take ci = C 2 = 1, C 3 = — 1, and write 
aJ/foraJij-. Thus 


7 = Xi + X2 — X3 x^, a = — xz^. 

If the X^s are distinct, the conics 7 = 0 and d = 0 have four 
distinct points of intersection given by 

3 /1^ « X^ I X^ = X2 “ Xs I X3 Xi I X2 ““ Xi. 

If Xi 7 *^ X 2 == X 3 , the only intersections have a:i = 0, = ± 353 , 

and the conics are tangent at two points. If Xi = X 2 = Xs 7 ^ 0, 
the conics coincide. 

(ii) ei = 2, C 2 = 1. Take ci = C 2 = di == 1 , and write a;i, x^f ocz 
for Xu, x^L, Xu* Then 
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7 = 2Xi X 1 X 2 + + X 2 x^y d == 2xi X 2 + Xs^. 

If Xi = X 2 , the only intersection has xi — Xz = 0, and the conics 
have contact of the third order. But if Xi 5 ^ X 2 , there are two 
further points of intersection having 

Xx ^ 2 ^X 2 Xl)jr2j X^ ~ 4:(X2 Xl)il/2^j 

and the conics have simple contact at Xi — Xz ^ 0. 

(iii) Cl = 3. Take Ci — di = 1, and omit the second subscripts 
on the x'b. Then 

7 = Xi(2a7i Xz “h X 2 ^) “1“ 2 xi x^y d = 2 xi Xs d* X 2 ^» 

The conics have contact of the second order at xi X 2 = 0, and 
cross at X 2 = Xz — 0, 


Exercises 

1. List the canonical forms of pairs of quadratic forms in four variables.^ 

2. If A and iV are any two n-rowed square matrices, there exists an n-rowed 
non-singular matrix X such that X^A% = iV (whence A and N are con¬ 
gruent or conjunctive) if and only if the pair A, 2.'is equivalent® to the pair 
iV, N'. Hints: If they are equivalent, there exist matrices P and Q such that 
PAQ = Nj P£^Q = it\ Add and subtract and apply Theorem 13. 

65. Further applications of matrices to forms^ available in 
English. Hurwitz’s identity expressing the product of a sum of p 
squares and a sum of n squares as a sum of n squares.® 

^Bromwich, Quadratic Eorms, 1906, 46; or Hilton, Homogeneous Linear Substi¬ 
tutions, 1914, 105. 

^ Hence the problem depends upon the singular case. Muth, Elementartheiler, 
1899, 144r-51. Encyclop5die des so. math., 1.1, v. 11, 471-4. 

«Eor p - itj Dickson, Annals of Math., (2), 20, 1919, 160-4. Hurwitz, Math. 
Annalen, 88, 1922, 1-25. Radon, Abhand. Math. Hamburg XTniv., 1, 1921, 1-14, 
found the maximum p for a given n when the matrices are real. Generalization to 
forms of any order, Dickson, Congr^s International des MathSmatioiens, Stras¬ 
bourg, 1920, 215-30; Comptes Rendus Paris, 172, 1921, 636-40, 1262. 
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Fornis expressible as determinants with linear elements.^ 
Frobenius^s theory of matrices whose elements are polynomials 
a one variable; application to the equivalence of pairs of bilinea/i* 
Drms.^ 

Equivalence of two quadratic or Hermitian forms in a generail 

eld.3 

Singular case of pairs of bilinear, quadratic, or Hermitian forms-^ 
Weierstrass® gave a simple application of the theory of pairs of 
ilinear forms to the integration of a system of n linear differentia»l 
quations of the first order in n variables with constant coeflacients. 
For applications of elementary divisors to linear differential 
quations. see texts on the latter subject. 

1 Dickson, Trans. Amer. Math. Soc., 22, 1921, 167-79 (24, 1922, 185; 26, 1926, 
37); Amer. Jotit. Math., 43, 1921, 102-34; Annals of Math., 23, 1921, 70-74. 
2B6cher, Introduction to Higher Algebra, 1907,262-284. Of. Dickson, Algebraei 
id their Arithmetics, 1923, 169-174; revised in the German edition, Zutioh, 1926 • 

®Dickson, Bull. Amer. Math. Soc., 14, 1907-8, 108-115 (simplification of Trans- 
mer. Math. Soc., 7, 1906, 275-292); Quar. Jour. Math., 39, 1908, 316-33. 
^Kronecker, Berlin. Sitzungsberichte, 1890,1225-37,1375-88; 1891, 9-17, 33-44. 
mpler treatment by rational methods, Dickson, Trans. Amer. Math. Soc., 192T 
0.1909, 358-60). 

5 Berlin. Abhandl., 1875; Werke, n, 75-76. 




Chaptbe VII 

HBST PRINCIPLES OF GROUPS OF SUBSTITUTIONS 

The fundamental concepts of substitutions and groups will be 
introduced in a very concrete and natural way in connection with 
the solution of cubic and quartic equations. The reader will there¬ 
fore appreciate from the start some of the reasons why these 
concepts are employed. The deliberate presentation and numer¬ 
ous iUustrative examples will enable the reader to digest these 
somewhat abstract ideas. 


66 . Cubic equations. Of various methods of solving 
(1) a? + cx + d = 0 


we shall present the method which best illustrates our later theory 
of the solution of equations of any degree. If xi, xa, xt are the 
roots, then 


X ) *1 ^ + ®8 = - &, 

^XiXi = Xi.Xi-\- XiXi + XiXi = c, XiXzXs = —d. 
Let (i» be an imaginary cube root of unity, so that 


( 3 ) 


^2 ^ 1 -- 0, W® = 1. 


As soon as we have computed the values of 
(4) <f> = Xi+ CjOX 2 + rp ^ Xi+ X2+ COX3, 


we can find the x’s by solving these linear equations with the first 
one of (2). First, we add the three equations and get the value 


.135 
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of SiCi. Next, we multiply them by m®, «, 1, respectively, and add. 
Finally, we multiply them by ca, <o®, I and add. Using (3), we get 

Xi = ii<l> + ^(^ + miA — 6), 

% = i( + w* vt- — 6). 

To compute the values of 4 > and we employ 

= 2 »!*+(« + M*) ^ XlXz 

^ = (S a:x )* - 3 2 ®2i = -d = 1 ^ — Zc, 

4fl + ^ = 2Z ““ 3 X) ^2 + 12a;i X2 Xs 

= 2(2 “ 9Z) : J^XiOC2 + 27xx Oh Xs = B, 

^ ^ 5 = - 263 + 96c - 27d, 

- ^)2 (^3 4. ^ 3)2 _ 4^8 ^3 £2 _ 443, 


Hence <^>3 — ^ is equal to one of the square roots of — 44®. 
Choosing a particular square root and employing also (7), we get 

<^3 ^ + (52 _ 443)*] ^ 

^ ^ ^ = |[B - - 443)i] =. A. 

The cube roots and ^ ot g and A must be chosen so that 
<t>\p = A by (6). We agree to obtain ^ as a cube root of g and then 
compute 4' as the quotient A/<l>. 

If 0 is a particular cube root of g, the remaining cube roots are 
«0 and We therefore consider the functions 


(9) 


01 = CO0 == iC3 -j“ caXi CO® CC2) 
01 = CO® 0 = tCs + C 0 X 2 + CO® Xiy 


(10) 


02 = CO® 0 — ir2 4 " corCs 4 “ co® Xi, 
02 = CO 0 = iC2 + coa:i 4 " co® Xz, 


§66] 


CUBIC EQUATIONS 


137 


The replacement of 4 ) and \f' in (5) by u4> and respectively, 
has the effect of replacing Xi by Xs, by Xi, and Xt by Xi. This 
operation is called a substitution on the roots (or letters) xi, Xi, xa, 
and is denoted by (xiXgXi). Such a q/ch signifies that each letter 
is replaced by the letter written just to the right of it, while the 
last letter xa of the cycle is replaced by the first letter xi. 

Similarly, the replacement of 4> and ia (5) by t>)®<A and 
tayp respectively has the effect of applying the substitution 
(a:i Xa). 

If we had chosen the other square root of — 4A*, we would 
have obtained 4 ? = k, \l^ = g, instead of (8). The interchange 
of 4 and in (5) has the effect of applying the substitution 
(xiXa) which interchanges Xa with xa and leaves Xi unaltered. 
The replacement of 4 by and 4 by w*</> in (5) has the effect 
of applying the substitution (a;i Xa). Knally, the replacement of 
4 by t)^4 and by in (5) has the effect of applying the sub¬ 
stitution (XiXa). 

We may combine the essential parts of these results as foEows: 
If we have found a definite value of 4 (and hence of 4 = ^/4) by 
making a choice of the square root in (8) and then a choice of the 
cube root of the resiEting g, so that formulas (5) give definite 
values for xi, a^, Xa, then a different choice of the square root or 
of the cube root leads to new values of xi, xa, xa which are derived 
from the former values by applying a substitution on the roots. 
This condusion might have been anticipated, since if we start 
with the coefficients of the cubic equation and compute the roots, 
we are at Eberty to assign the notations Xi, Xa, xa to the roots 
arranged in any order. 


Exbbcisb 

Employing the linear functions (4), (9), ( 10 ), verify that the substitutions 
(xi xaxi) and (xi xt xa) replace 0 and 4 by 4 >i and 4 i, and 02 and 02 , respec¬ 
tively, while (xa xa) interchanges 0 and 4 , (*1 * 2 ) interchanges 0 i and 4 i, and 
(xi Xa) interchanges .02 and 4a. .Also, (xi Xa) replaces 4 and 4 by 02 and 02 , 
while (xi Xa) replaces 4 and 0 by 0i and 0i, respectivdy. 
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6'7. Discrimmaiit of a cubic equation. The discriminant A of 
a cubic equation in which the coefiScient of a? is unity is defined 
to be the product of the squares of the differences of the roots. 
We shall compute it by means of the value 5® — 4A.* of (<^* — 4^)* 
in §66. Since the cube roots of unity are 1, «, and we have 

z® — 1 s (s — 1) (z — «) (a — «*) 

for all values of z. Taking z — we see that 

4? — 4^ = (.(!> — 4') ~ (<^ — w® ^). 

From (4), (9), and (10), we get 

4 — 4 = (u ~ — Xs), 

( 11 ) 4> — 0)4 = {1 — (a)(xx —Xi), 

4 — ui^ 4 =s (1 — ft)®) (xi — a^). 

We desire the product of the squares of the three expressions 

(11) . By use of (3), we get 

(ft) — ft)®)® = — 3, (1 — ft)) (1 — ft)®) =» 3. 

Hence B® — 44® = — 27A. This gives 

(12) A = 18bcd-4b»d4-l)®c®-4c®-27d*. 

Exebcissss 

1. The discriminant ofa;* + «e + d = 0is—4c* — 27d*. 

2. Show that X* — 27x + 54 » 0 has a multiple root by computing its 
discriminant. 



§68] 


SUBSTirirnONS on n letters 


139 


3. If we define the discriminant A of ao;® + -h ca; + = 0 to be 

a* P, where P is the product of the squares of the differences of the roots, so 
that P is derived from (12) byreplacing h, c, d by h/a, c/a, d/a, respectively, 
show that A = 18a&cd — 4&® d + 6® c® — 4ac® — 27a® d®. 

4. If D is the discriminant of the corresponding cubic form + bx^V 
Cfl52/* + d 2 /®, show by (19) of Ch. I that A = — 27D. 

6. If A 7 *^ 0 and -A 0 (so that <;i» 0, ^ 0), verify by using (11) that 

the six functions (4), (9), and (10) have six distinct values. In the Exercise in 
§66, we gave a substitution which replaces by any one of the remaining 
five functions. The identity substitution which replaces each root Xi by itself 
does not of course alter 4>, Since therefore the six substitutions on the three 
roots replace <i> by six functions whose values are all distinct, *f> is called a 
six-valued function. 

6. Show that also ^ is a six-valued function. Why does it follow without 
computation that each of the four functions (9) and (10) is six-valued? 


68. Substitutions on n letters. The operation s which replaces 
iCi by Xaf Xz by rcb, . .. , by Xi, where a, 6, ... , Z form a 
permutation of 1, 2, . .., n, is called a mhstitution on xiyX^, . .. , 
Xn- It is given the two-rowed notation 


( 13 ) 


/XlXi ‘ ■Xn\ ^ /12 •••»\ 

\XaXh ■ ■ • xi) \ab • ■ • I /’ 


where in the second form only the subscripts are written. Since 
the order of the columns is immaterial, we may permute them in 
any manner without changing the meaning of s. 

To each piermutation a, b, ..., I oi 1, 2, ..., n corresponds one 
and only one substitution. Hence the nwmb&r of distinct siibstitur 
Uom on n letters is n\ = n{n — 1) • ■ • 3 -2 ■!. 

Instead of the two-rowed notation in (13), we often employ the 
one-rowed notation of cycles ( §66). The relation between the 
two notations will be clear from the following substitutions: 
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69. Product of substitutioas. Given a substitution s in (13) and 
another substitution t, we may permute the columns of t in such a 
way that the letters in its upper row are the same as the corre¬ 
sponding letters in the lower row of s. Then if 


( 14 ) 


t 


/ah 

\aP 



the effect of applying first s and afterwards t is the same as the 
effect of applying the single substitution 


(15) 


P - ( 
\a 


12 


n 

X 


). 


which is called the prodvuA of s by t We write p = sf. The word 
product is used here in the sense of resultant or compound. 

We may, of comse, find the product of substitutions each ex¬ 
pressed in cycles. For example, 


(12) (34)-(23) = (1342), (23)-(12) (34) = (1243). 


To find the first product, note that the first factor replaces 1 by 
2 and the second factor replaces 2 by 3, whence the product 
replaces 1 by 3; etc. Since the two products of our substitutions 
taken in different orders are distinct, we see that multiplication of 
substitutions is not always commutative. 

When a substitution is composed of two or more cycles (on 
different letters), it may be regarded as a product of commutative 
cyclic substitutions each composed of a single cycle. For example, 
(12) (34) = (12) .(34) = (34)-(12). 

Multiplication of substitutions is ossociaMve: st-v =‘ s -tv, so 
that the notation is unambiguous. Let s, t, and the product 
St — p have the notations in (13), (14), (15). Let 


/« 

\AB 


X 

L 


)■ 


I 



V 
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io 


/ ab 
\AB 



St - V — pv 


-( 


1 2 
AB 



= s -tv. 


From this result we readily deduce the associative law for prod¬ 
ucts of four substitutions a, b,c,d: 


a • bed = a[b • cd] = cib-cd — [oft • c]d = a&c • d, 


so that the notation abed is unambiguous. The associative law 
for products of any number lb of substitutions is readily proved 
similarly by induction on lb. Thus there is a single product of 
k substitutions oi, 02 , ..., a* taken in a fixed order, and the nota¬ 
tion ai 02 • • • Oft is imambiguous. In particular, if the k factors 
are all equal to a, their product is denoted by o* without ambi¬ 
guity and is called the Ath power of o. 

70. Identity, order, inverse. The substitution which leaves 
unaltered each of the n letters is called the idenUty and denoted 
by I. 

A substitution p is said to be of order (or period) A if A is the 
least positive integer such that p^ = I. For example, (12) and 
(12) (34) are of order 2. 

If p = (123 .. .r),p^ replaces 1 by 3, p* replaces 1 by 4, ..., 
and for j < r, p» replaces 1 by j + 1, so that the order of p is not 
less than r. Moreover, p’’ — I. Hence the order of a single cycle 
on r letters is r. 

If o, 6 ,... are single cycles of r, s, .. . letters respectively, and 
if no two of these cycles have a letter in common, then [ab •••]*» 
a* b* • • • will be the identity if and only if o* = /, b* = I, ..., 
and hence if and only if A is a common multiple of r, a, .... 
Hence the order of any substitution is the least common multiple 
of the orders (numbers of letters) of its component cycles. For 
example, (123) (45) is of order 6, while (12) (3456) is of order 4. 
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Every substitution s has an inverse such that ss~i = I, 
s-i s — I. This foEows from 


( 16 ) 




(ah 

W2 


I 

n 


)• 


Exercises 

1. If s ^ (123), verify that s* « (132), s* « /, s’l « 

2. If s is of order fc, 

3. A cycle is not altered by a cyclic permutation of its letters. 

4 If ~ sr, then f = r. 

5. The inverse of (123 • • • r) is (r r — 1 • • • 321). 

71. Quartic equations. To solve 

(17) + bx^ + cx^ + dx + e = 0, 

transpose the last three terms and complete the square on the 
first two; we get 

ix^ + _ c)x^ --dx-e. 

The device of Perran (1522-1566) consists in deriving an equa¬ 
tion in which also the second member will be a perfect square. 
This is accomplished by adding {x? + ^hx)y -f- to each mem¬ 
ber; we get 

(18) {x? + h^x + = (ib^ - c + y)x^ + (iby - d)a5 + - e. 

Denote the second member by q ^ rx^ + sx + t If it is the 
square of mx + n, then rn? — r, 2mn — s, — t, whence 
^ — 4tL Conversely, let the last condition be satisfied. If r = 0, 
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then s = 0 and g is a constant kid hence a square. If r 0, g is 
the square of r* a: + isr~i. Hence ra? + sx 1 is the square 
of a linear function of x if and only if s® = 4rf. Applying this 
result to the second member of (18), we see that it is a perfect 
square if and only if 

(19) 2 /® — cy* + Q)d — A.e)y + 4ce — 6* e — dP = 0. 

If 2/1 is a root of this resolvent cubic equation, (18) implies 

aP + i&a: + i2/i = ww: + « 

or aP + + §2/1 = “ ’w® ~ 

Let a;i and xs, be the roots of the first of these quadratic equar 
tions, and xz and Xi be the roots of the second. Then xi, ... ,Xi 
are the desired roots of (17). Note that 

= I 2/1 — Wj *8 ®4 = 42/1 + n, Xi ita + «5 a:!4 = 2/1- 

If instead of 2/1 we employ another root 2/2 or 2/s of the resolvent 
cubic (19), we obtain two quadratic equations different from (20), 
such that their four roots are a:i, Xi, xz, a^ paired in a new manner. 
The root which is paired with a:i is now Xz or Xi. Hence the roots 
of the resolvent cubic (19) are 

(21) yi = xi Xi + Xz X 4 , yz = xiXz + xg Xi, 2/8 = ®i ®4 + ®2 ais. 

The discriminant of (17) is defined to be the product of the 
squares of the differences of its roots. By (21), 

2/1 — 2/a = (®i - ® 4 ) (a^! — Xz), 

(22) yi-yz = (xi - xz)ixi — Xi), 

2/2 — 2/s = (®i — ®a) (®3 — * 4 ). 
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Taking the product of the squares, we see that the discnminant of 
a guartic equation (17) is eqical to the discriminant of its resolvent 
cubic (19). 


Exbecises 

1. The resolvent cubic ofa:^ + l « 0 is 2 /* — 42/=»0. Hence a:* + 1 has 
the pairs of factors a;* ± i, ± 2* a; + 1, a;* ± ( — 2)» a? — 1. 

2. The discriminant of a:^ — 8a?* + 22a;* — 24a; + 9 = 0 is zero, so that 
the equation has a multiple root 

3. If d = 6, c =* 1, so that (17) is a reciprocal equation, the resolvent cubic 
is (y — 2) [y* + (2 — c)y + 6* — 2c] = 0. The fact that 2 is a value of one 
of the y’s in (21) follows directly from the fact that the roots are reciprocal in 
pairs. In particular, taking y = 2, we find that a;* + a;* + a;* + a; + l has the 
factors a;* + i(l ± 5^)x + 1. 


72. Groups of substitutions. A set of m distinct substitutions 
is called a group of order m if every product of two of them and the 
square of each of them are substitutions of the set. 

For example, the n\ substitutions on n letters form a group, 
called the symmetric group on n letters, since the product of any 
two substitutions on n letters and the square of any one of them 
are substitutions on n letters. Again, if $ is a substitution of order 
m, the? substitutions J, s, ..., form a cyclic group of order 
m, generated by s. The identity group is composed of the single • 
substitution I, 

If all the substitutions of a group belong to another group, the 
former is called a subgroup of the latter. 

As a generalization, we shall next define a group whose elements need not 
be substitutions. An abstract group is a system composed of a set of elements 
a, 6 , ... and a rule of combining any two of them to produce their “product/' 
such that (i) every product of two of the elements and the square of each 
element are elements of the set, (ii) the associative law holds, (iii) the set 
contains an identity element I such that la ^ aJ ^ a for every element a 
of the set, and (iv) each element a of the set has an inverse belonging to 
the set, such that aa"^ s=i a = /. 
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In our definition of a group of substitutions, we employed assumption (i) 
alone. The associative law holds (§69). If a is a substitution of order m of the 
set, then a" — / and 0 ”“* >= o~‘ belong to the set. Hence for sets of sub¬ 
stitutions, properties (iii) and (iv) follow from (i) and (ii). 

73. Group leaving a function invariant. Let a ?4 be dis¬ 

tinct. By (22), the three functions yi = «! % -f- a;*, 2/2 = Jti + 
XiXi, ya = xiXi + X 2 Xa are distinct. Since ya and ya are the only 
other functions of the same form as yi, it is to be anticipated that 
each of the 24 substitutions on Xi, Xa, Xa, Xt replaces yi by yi, ya, or ya 
and hence that |24 or 8 of them leave yi unaltered. To verify 
these statements, note that a = (12) and b = (13) (24) leave 
XiXa + xaXi unaltered. The same is therefore true of the products 

c = = (1423), d = 6a = (1324), e = ca = (14)(23), 

he = (12) (34), a. 6e = (34). 

Hence yi is unaltered by the eight substitutions of the set 

Ge = {I, (12), (34), (12) (34), 

(13) (24), (14) (23), (1423), (1324)}. 

Evidently substitution (23) replaces yi by ya. Hence the eight 
distinct products of the eight substitutions of <?8 by (23) all re¬ 
place yi by ya. Sunilarly, the eight distinct products of the eight 
substitutions of Ga by (24) all replace yi by ya. There is no sub¬ 
stitution in common with two of these three sets of eight sub¬ 
stitutions, since they replace yi by yi, ya, ya, respectively. Hence 
together they give all of the 24 substitutions on Xi, xa, Xa, Xa. We 
conclude that evay substitution on the oi’s replaces yi by yx, ya, 
or ya, so that yx is a three-valued function. Moreover, we con¬ 
clude that Ga is composed of all the substitutions on the a;’s which 
leave yx unaltered. Hence every product of two equal or distinct 
substitutions of Gs is equal to a substitution of Ga, so that (J* is a 
group of order 8. 
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.We shall say that the function iCi a ?2 + iCs belongs to the group 
(?8 since it remains imaltered by all the substitutions of 0 $ and is 
altered by all further substitutions on Xi, x^, Xs, X 4 . 

We saw that aU eight substitutions of Gs may be obtained by 
successive multiplications starting with a and 6. Hence the 
group Gs is said to be generated by a and 6, 


Exercises 

1. Since 2/2 =» is derived from yi by mterchanging x% and a;a, 

the group Rg to which yz belongs is derived from by interchanging 2 and 3: / 

Rg = {/, (13), (24), (13) (24), (12) (34), (14) (23), (1432), (1234)}. 

2. By interchanging 2 and 4 in ft, obtain the group 

^g - {/, (14), (23), (14) (23), (13)(24), (12) (34), (1243), (1342)} 
to which yz ^ XiXi^ X 2 xz belongs. Find two generators of this group. 

3. For t ^ xiXz ^ Xz X4f we have — 4yi + &* — 4c. Hence a sub¬ 
stitution leaving i unaltered leaves also yi unaltered Show that i belongs to 
the subgroup {/, (12), (34), (12) (34)} of ft. 

4. The six functions obtained from = 2/1 + «V 2 -f ya by applying the 
six substitutions on 2 / 1 , yz, yz are all distinct if .. ., x^ are independent vari¬ 
ables. Then 4> belongs to the group Q composed of the substitutions on the 
x's which leave ^ 1 , ^ 2 , yz simultaneously unaltered. Hence Q is the greatest 
common subgroup of ft, Hg, J?g, so that 

G ^ {/, (12) (34), (13) (24), (14) (23)}. 

Verify also that (? is a commutative group. 

5. For three distinct a:^s, {xi — xz) {xi--Xz) {xz ^ xz) belongs to the 
group {J, a » (123), (132)}. Verify also that this is a cyclic group generated 
by a. 

74. Alternating group. A transposiUon is a substitution, like 
(13), which interchanges two letters and leaves unaltered the 
further letters. Every substitution is a product of transpositions, 

^And in further oases. See Ex. 5 of §67. 
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For, it is a product of.cycles on different letters, while a single 
cycle with n letters is a product of w — 1 transpositions: 

(123 • • • n -- 1 n) = (12) (13) • - • (171 - 1) (1 n). 

But the same substitution can be decomposed into transpositions 
in various ways. For example, 

(123) - (12) (13) - (13) (23) - (13) (23) • (46) (45). 

Of the various decompositions of a given substitution s into 
transpositions, either aU contam an even number of transpositions 
(and s is then called an even or 'positive substitution), or all con¬ 
tain an odd number of transpositions (and s is called an odd or 
negative substitution). This is proved by using the alternating 
function 

p « (OJi - Xz) (xi - Xb) (Xi - Xa) • • • {xi - Xn) 

■ — Xs) (Xi- Xa) • • • (a ?2 - Xn) 


• (Xn^l “ Xn)i 


in which aji, .,., aJn are assumed to be distinct. Any transposition 
(XiXj) with i <3 merely changes the sign of P (seethe preceding 
Ex. 6). For, the factors of P which involve neither Xi nor a;/ are 
evidently unaltered. The factor Xi-- Xj involving both is changed 
in sign. The remaining factors may be paired to form the products 

± {xi — Xk) {xj — Xk) [fc = 1, ... , n; fc 5*^ i, & 5*!^ j ]. 

Such a product is unaltered. Hence P is merely changed in sign. 

Hence if s is a product of an even number of transpositions, it 
leaves P unaltered. But if s is a product of an odd number of 
transpositions, it replaces P by — P. This proves the above asser¬ 
tion which justifies the terms even and odd substitutions. 
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All even substitutions on n letters therefore form a group, called 
the aMemating group on n letters. It is the group to which the 
alternating function P belongs. 

75. Theorem. If a group G contains an odd svhstitvMon t, exactly 
half of the svbstituiions of G are odd. 

For, the products of its even substitutions by t are distinct and 
odd, so that G contains at least as many odd substitutions as 
even ones. Again, the products of its odd substitutions by t are 
distinct and even, so that G contains at least as many even sub¬ 
stitutions as odd ones. The two results show that G contains 
exactly as many odd as even substitutions. 

It follows that the alternating group on n letters is of order 
I -nl. Since it is a subgroup of the s 3 nQ!imetric group of order wl, 
we have an illustration of the next theorem. 

76. Theorem. The order of any group is a multiple of the order 
of any subgroup. 

Let a group G of order r have a subgroup H of order s < r, so 
that G contains a substitution g2 not in H. Write Hgt for the set 
of s substitutions obtained by multiplying each substitution of 
H by ga. If a substitution hga of Hga were equal to a substitution 
h' of H, then ga = h~^ h' would belong to the group H, contrary to 
hypothesis. Hence there are 2s distinct substitutions in the set 
H Sgi) which denotes the aggregate of the sets H and Hga- 
If r = 2s, the theorem is proved. If r > 2s, (? contains a sub¬ 
stitution ga not in or Hga. As before, no substitution of Hgs 
belongs to H. If a substitution hga of Hga were equal to a sub¬ 
stitution A' ga of Hga, then ga = h~^ h' ga = hi ga would belong to 
Hga, contrary to hypothesis. Hence there are 3s distinct sub¬ 
stitutions in the set H -t- Hga + Hga. Unless the latter coincides 
with G, we repeat the argument using a new substitution ga. 

If (? is a group on n letters, its order does not exceed nl. Hence 
the above process terminates and gives 
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(? = J? + Hg2 + Hqz + • • • + Hqu^ 

Thus G is of order r = su. This integer u is called the index of H 
under 0. 

Exercises 

1. The order of any group on n letters is a divisor of nl, 

2. The order of any substitution of a group G divides the order of G, 

3. Hence a group of prime order is cycho. 

4. The alternating group on 4 letters is composed of I, (12) (34), (13) (24), 
(14) (23), and the eight cycles of three letters. 

5. All the substitutions common to two groups form a subgroup of each. 



Chapter VELI 

FIELDS, BEDUCIBLE AND IRREDUCIBLE FUNCTIONS 

This chapter presents certain concepts and elementary theorems 
which will be needed in the Galois theory of equations. 

77. Fields, adjunction. A set of complex numbers is called a 
nun^er field if the sum, difference, product, and quotient (except 
by zero) of any two equal or distinct numbers of the set are them¬ 
selves numbers of the set. Examples were given in §28. 

All rational functions of one or more variables with coeflSioients 
in a number field form a function field. In the Galois theory of 
equations, we employ a field jB (fci,,. -, km) composed of all 
rational functions with rational coefficients of fti, — , km, either 
all of which are constants, or certain of which are constants and 
the others are given functions of specified variables not neces¬ 
sarily independent. For example, 22(3*) is composed of ah 
numbers 

a -f &3^ 
c d3* 

in which a, 6, c, d are rational numbers and c and d are not 
both zero. Multiplying numerator and denominator by c — d3*, 
we see that 22 (3*) is composed of all numbers e -f- fS^ in which 
e and / are rational. Hence 22 (3*, x) is composed of all rational, 
functions of x whose coefficients are of the form e + /3* with 
e and / rational. 

The elements, whether numbers or functions, of a field are con¬ 
veniently called quantities. 

Altiiough the set composed of the number zero alone satisfies 
the above definition of a number field, we shall exclude it and 
hence agree that every field F contains a quantity q 9 ^ 0. Hence 
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F contains g/g = 1, 1 + 1 = 2, etc., and therefore contains all 
rational numbers. In other words, the field R of all rational 
numbers is a subfield of every field F. The foregoing field 
R (ki, ..., km) is said to be derived from R by the adjunction of 
kxf ... jhm to R. 11 n <m, R (fci, ..., km) is derived from 
RQzi,, kn) by the adjunction of ... ^km* 


Exercises 

1. If = — 1, every number of 22 (i) can be expressed in the form e + ‘ 

where e and / are rational numbers. 

2. 22(3*) = 22(12*). 

3. Writer = 3*i, « = — f 4* ir. Thea22(r) - 22(w) = 22(o)»), 

4. If 5 = 1 4- 2*, 22(t, 2*) - 22(«). Hint: t - 2* = - 3/s 

6. In Ex. 4, 22 (s) is derived from 22 (i) by adjoining 2*. 

78. Greatest common divisor. Let /(a;) and g{x) be poly¬ 
nomials in X with coefldcients in a field F, such that g{x) is not 
identically zero. Let the division of /(a;) by g (x) yield a quotient 
d(x) and a remainder r(a;) whose degree is less than that of 
g(a:). Similarly, let the division of g by r yield a quotient b and 
a remainder s whose degree in a: is less than that of r. Proceeding 
in this manner, let 

f ^ ga + Tf g = rh + s, r — sc + s = id, 

where the fourth remainder has been assumed to be identically 
zero, to simplify the notations. Since a, r, 6, ... were obtained by 
rational operations, all of their coefficients are quantities of F. 

Any common divisor of / and g is seen in turn to be a divisor of 
r, s, and t. By employing the equations in reverse order, we see 
that i is a divisor of s, r, g, and /. Since ^ is a divisor of / and g and 
since any common divisor of them is a divisor of t, we call t a 
greatest common divisor of f and g. It is determined uniquely apart 
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from a factor belonging to F; in particular, there is a single greatest 
common divisor having unity as coefficient of the highest power 
of X. 

The displayed equations give in turn 

r^f-ga, s = g{l + ah) - fb, 

f = /(I “h ho) — gF(a + c “h ahc). 

Theorem 1. Any two 'polynomials f(x) and g(x), not hath 
identically zero 'with coefficients in a field F, have a unique greatest 
com'mon divisor d{x) whose leading coefiicient is unity. All of its 
coefficients belong to F, and there exist two polynomials u(x) and 
V (x) 'With coefikients in F such that 

d{x) s u{x)f{x)+ v{x)g{x). 

The preceding discussion applies also when / and g denote 
integers. Let g be positive. Then the division of / by gr yields 
an integral quotient a and an integral remainder r such that 
0^ r < gr. Hence / and g have a unique positive greatest com¬ 
mon divisor. In case it is unity, we call / and g relatively pri'me. 
Hence we have 

Theorem 2. If f a'nd g are relatively pri'me integers, there exist 
integers u and v such that uf + vg ^ 1. 

79. Roots of algebraic equations. Consider an equation 

(1) f(x) ^ x^ + Cl x”'-^ + •' • + c« = 0. 

When ?^ = 3 or 4, we found formulas in Ch. VII which express 
the n roots of (1) in terms of the coefficients Ci, But when n > 4i, 
,no such formulas are known except for special equations like 

— c = 0. 
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What shall be meant by roots of (1) when the coefficients c, are 
independent complex variables? In that case the method ( §78) 
of finding the greatest common divisor of / (x) and its derivative 
f'(x) leads to a final remainder r(ci, ..., c„) which is free of x 
and is not zero identically. For, if f(x) s a;" — 1, then 
/' (a:) = nx"~^ and r = — 1. Let Ci, ... , Cn be constant values 
of Cl,..., c„ such that r(Ci, ..., On) 9^ 0. Then the resulting 
function (1) has no factor in common with its derivative, and 
equation (1) has n distinct roots. Since the polynomial r is a con¬ 
tinuous function of the c’s, we have r(ci, ..., c„) 0 for all sets 

of values of ci,..., c* for which each difference c,- — Ci is suffi¬ 
ciently small in absolute value. For each such set of values of 
Ci, ..., Cn, equation (1) has n distinct roots Xi, ..., Xn, and 
each Xi is known^ to vary continuously when the c’s vary con¬ 
tinuously. These numbers Xi are therefore the values pf a con¬ 
tinuous function aji of Ci, ..., c„ for those values of the c’s such 
that each difference Ci — Ci is sufficiently small in absolute value. 
These functions Xi, ..., Xn bxb called the roots of (1). In brief, 
the root values may be assembled into root functions. 

If the Ci are not independent variables, but are functions of 

certain variables «,■ such that r (ci,__ c«) is not zero identically 

in the v,-, the preceding discussion remains valid if we consider 
only sets of values of ci, ..., c« which are obtained by varying 
the Vj continuously. 

80. Redudbility. A polynomial in one variable x with coeffi¬ 
cients in a field F is called redttdbU in F if it can be expressed as a 
product of polynomials in x, neither a constant, with coefficients 
in F. It is irreducible m F if no such factorization is possible. 

For example, — 4 is reducible and a® — 2 is irreducible in 
the field B of rational numbers. But ar® — 2 is reducible in R (2*). 

According as / (x) is reducible or irreducible in F, the equation 
f(x) = 0 is called reducible or irreducible in F. 

^Weber’s Lehrbuch der Algebra, ed. 2,1, 1898, 148. Gf. Coolidge, Annals of 
Math., (2), 9, 1908, 116-8. 
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Theorems. Letf(x) and g(x) be polynomials with coefficients 
in afield F and letf (x) be irreducible in F, If one root a off(x) — 0 
satisfies g(x) == 0, thenf(x) is a divisor of g{x). 

The greatest common diyisor d{x) oif{x) and g(x) is not a 
constant, since it has the factor a? — a. By §78, the coeflSlcients of 
d{x) belong to F. The quotient affix) by d(x) is a constant, 
since other\\dse f{x) would not be irreducible in F. Hence 
d(x) = qf(x), where c is a quantity independent of x belonging 
to F. But d(x) divides g(x). Renee f{x) divides g(x). 


Exercises 

1. A root of an equation irreducible in F does not satisfy an equation of 
lower degree with coefficients in F. 

2. An irreducible equation/(x) ~ 0 has no multiple root. Hint: Consider 
f'M =0. 

3. Two equations each irreducible in F are identical if they have a com¬ 
mon root. 

4. + 1 is reducible in any field which contains'one of the four numbers 

2 if (—2)i, i = {— 1)^ p — (1 +i)2~i, but is irreducible in all other 
fields. Hints: Its linear factors are x ± p and x ± p"^, while its pairs of 
quadratic factors are ± i, x^ ± 2ix + 1, ± — 1. 

81. Gauss’s Lemma* If a polynomial f{x) with integral coeffir 
dents, that of the highest power of x bdng unity, is the product of two 
polynomials with rational coefficients, 

^(x) - x”' + bix”-^ + •••+&», 

4>(x) = «" + Cl a:»-i + •••+(!„ 

iJim these coefficients are aU integers. 

Let the fractions 6i, — ,im be brought to their least positive 
common denominator Bo and write hi = Bi/Bo. Then Bo, 
Bi,... ,Bm are integers having no common integral divisor >1. 


1 
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Similarly, let Ci — Ci/Co, where Co, Ci, ..are integers hav¬ 
ing no common divisor > 1. The theorem is obviously true if 
Bo = 1, Co = 1. Next, let Bo Co > 1. Multiplsdng the mem¬ 
bers of / s ^ ^ by i5o Co, wC get 

(2) Bo Cof(x) = gh, 

where 

g - Box'” + Bi a:“-i -f ■ ■ ■ + B^, 
h — Co x” + Cl X”~^ -|- • • • -|- Cn. 

Let p be a prime factor of Bo Co- Since p divides each coefficient 
of the left member of (2), it divides each coefficient of gh. Not all 
coefficients of g are divisible by p. Let B{ be the first coefficient of 
g which is not divisible by p. Let Gs be the first coefficient of h 
which is not divisible by p. The total coefficient of in 

gh is 

... + 

■Bi-l-2 C^A-2 + Bi^i Cfc-i + J5i (7 a 
(7a+i + Cfc+2 + '' ’ 

Since £i-i, J5i_2,..., So and (7&-i, Ck^ 2 , • • •, Cq are all divisible 
by Pf while BiCki^ not, the preceding sum is not divisible by p, 
contrary to the above. 


Exercises 

1. Gausses Lemma for the case m = 1 shows that if an equation os*’ + 
tti + • • • + ttr = 0 with integral coefficients, that of the highest power of 
X being unity, has a rational root, that root is an integer. 

2. An integral root of the equation in Ex. 1 is an exact divisor of the con¬ 
stant term a,. Hint: It divides all terms except the last. 

3. Show that — 2, a;® — 3a; + 1, and a;® — 7a; -f 7 are all irreducible in the 
field R of rational numbers. Hint: If such a cubic function were reducible in 
i2, it would be the product of a linear and a quadratic fimction with coeffi¬ 
cients in B. Apply the theorems in Exa. 1, 2. 
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82. IrreducibiKly of x^—A. 

Theorem 4. If p is a prime and if A is a quantity of a field F 
such that A is not the pth power of any quaniity of F, then — A is 
irreducible in F. 

An imaginary pth root of unity is given by 
p = cos 2 ir/p + i siu 2 t/p, 

since p*" = 1 by De Moivre’s theorem. By the same theorem, 
p'?^lifO<f<p. Hence 1, p, , p^-^ are all distinct and 
give all the pth roots of unity. Thus if r denotes one root of 
a;3> = A, all its roots are given by 

(3) r, pr, f^r, p^-'^r. 

Suppose that x^ — A has a factor /(») of degree t <p with 
coefficients in F, that of x* being unity. The product of the roots 
of f(x) =0 is, apart from sign, equal to the constant term. 
Hence p*r‘ belongs to F. By Theorem 2, there exist integers y 
and z such that ty — pz = 1. Hence F contains 

= p*" = p**' r A* = r'A*, 

where r' is one of the roots (3). Since A* is in F, also r' is in F. 
In other words, A is the pth power of a quantity r' otF, contrary 
to hypothesis. This contradiction proves that a:*' — A is irreducible 
in F. 

The theorem is illustrated by Ex. 3 of §81. 

83. Itreducibility of the cyclotomic equation. If p is a prime, 
the equation 

ajp — 1 

-— s xr -^ + + •••+*+1 = 0 

* — 1 
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has as its roots all the imaginary j^th roots of unity. As a general¬ 
ization, the equation 

— 1 

/(®) —7 =a;«(p-i)_^a;«C3>-2) _j-+1 = 0 = 

X* — 1 

has as its roots all the primitive p^’th roots of unity, viz., roots of 
ajP* 1 which do not satisfy a;* = 1. 

Let p be an arbitrarily chosen primitive p*th root of unity. 
Then p^^ ... , p^^" = 1 give all the t roots of x^ == 1. The 
remaining e tp --1 powers of p, with positive exponents 
ai, 02 , •.., less than tp and not divisible by p, are roots of 
f(x) =0 and give aU its roots. 

For example, if p* = 9, then p^, p«, p® give the roots of a;® == 1, 
while p, p^ p\ P®, p^, p® are the roots of a;® + a;® + 1 = 0 and 
are the primitive ninth roots of unity. 

Suppose that/(a;) is reducible in the field of rational numbers. 
Then by Gausses Lemma, / (a;) is the product of two polynomials 
<l>{x) and ^(a?), neither a constant, having integral coefficients, 
such that each has unity as the coefficient of the highest power of 
X. Smce/(1) = p, we have p = <^>(1) ^(1). Since p is a prime, 
one of these integers, say (1), has the value ± 1. At least one of 
the roots p“S ..., p"^® of /(a;) = 0 is a root of ^(a;) = 0; hence 

= 0. 

Here p is an arbitrarily chosen primitive p'th root of unity and 
hence is any one of the roots of / (a;) = 0, This proves that the 
function 

Fix) = <i>{x"^)4>{x^) ••• 

vanishes when x is replaced by any one of the roots of f(x) = 0 
(which has no multiple root). Hence 


Fix) ^fix)-q(x), 
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where the polynomial quotient q{x) has integral coefficients since 
the leading coefficient of the divisor f{x) is unity (or by use of 
Gauss’s Lemma). Taking a: = 1 and recalling that /(I) = p, we 
get 

[<^)(1)]« = (± 1)® = p* ff(l), 

where g(l) is an integer. Since this is impossible, we have the 
foUowing 

Theorems. The cyclotomic equation /(a;) = 0 whose roots are 
the primitive pHh roots of unity^ where p is a prirm^ is irreducible in 
the field R of rational numbers. 

Exercises 

1. For « 3, f(x) = a:® + a? + 1. For = 2®, f(x) = a;® + !• Verify 
that these quadratic functions are irreducible in B. 

2. For p* = 6, fix) =*a^-f-aj® + ic* + ic + l- Verify that it is irreducible 
in jB by discussing/(a:) s ^ ax + r) + bx + r"^). 



Chapter IX 

GEOUP OF AN EQUATION FOE A GIVEN FIELD 

84. Introduction. With a given algebraic equation we shall 
associate a certain group G of substitutions on its roots. The 
theory was initiated by E. Galois/ who was killed in a duel in 1832 
at the age of 21. In a later chapter we shall prove that the equa¬ 
tion is solvable by radicals if and only if G is a group of the kind 
called solvable and shall conclude that the general equation of 
degree five or more is not solvable by radicals. In another chapter 
we shall apply our theory to various questions ignored in elemen¬ 
tary geometry and, for example, prove that it is not possible to 
trisect every angle with ruler and compasses. 

85. Equal functions of the roots. Consider an equation 

(1) f{x) s a;” + Cl + • * • +Cn = 0 

whose coefficients are functions of certain variables Vj\ In § 79 
we explained the sense in which (1) has n roots which are func¬ 
tions of the Vj, 

Two polynomials 4> and ^ in the roots xi, ,, , ,Xn are called 
equal if they have the same value for all sets of values of ci,..., c„ 
employed there in defining the roots Xi. In case ^ and ^ are 
rational functions of the Xi, we impose the further restriction on the 
Ci that the denominators of 4> and ^ shall not be zero. 

We shall oaR ^ and ^ distinct if they are not equal. 

For example, consider the reciprocal quartic equation 

(2) + arc® + + 1 = 0, 

iJour. de Math., 11, 1846, 381-444; OEuvres, 1897; Manuscrits de Galois, 1908. 
For sources ozx Galois’s life, see Bull. Amer. Math. Soc., 4, 1897—8, 332; Scientific 
Monthly, 1921, 363-75. 
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where a and h are independent complex variables. Since the 
reciprocal of any root is also a root, we may choose the notations 
of the roots so that oJi % = 1, % = 1. Hence the two functions 

xi Xi and Xi Xi are equal. Thus the rational functions xi/xt and 
Xi/xz are equal. Again, for 


t = Xi-\- Xi — Xi — Xi, yi = XiXi + XiXt, 

we have 

^ - (H 

Hence the functions and — 46 + 4^i are equal. 

If the function ^ is derived from 4> by applying the substitution 
s on rci,..., and if ^ and <l> are equal in the foregoing sense, we 
shall say that <f> is unaltered by the substitution s. 

For the quartic (2), Xi Xz is unaltered by the two substitutions 
(13) (24) and (1324), but is altered by (13). 

86. Function of the n roots with n! values. In our study of an 
equation f{x) =0 of degree we may assume without real loss 
of generality that the n roots are distinct. Otherwise, f{x) and 
its derivative f{x) would have a greatest common divisor g{x) 
not a constant, and f(x)/g(x) = 0 has no multiple root and is 
satisfied by every root of /(rr) = 0. Hence let f(x) = 0 have the 
distinct roots a;i,..., a;„. We shah prove 

Theorem 1. There exist integers mi,..., ?w„ such that 

(3) miXi + mzX2+ -h w^na?n 

gives rise to n\ distinct functions F, when the n\ suhstitutions s on 
Xi jfXn are applied to Fi. 

The subscript 1 indicates that we denote the identity substitu¬ 
tion by 1 instead of I. 

For n = 2, we may evidently take Vi === Xi. The case — 3 
was discussed in Ex. 5, §67. 
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For any ti > 1, we are to prove that we can choose integers 
miy,,. ,mn which satisfy no one of the p == nl(nl — l )/2 equa¬ 
tions F, == V t, where s and t are distinct substitutions. If 5 is 
the identity and t = (a;i xss), the equation reduces to nii = m 2 , 
and this is the only one of our p equations which involves only 
mi and W 2 . It is not satisfied if mi = 0 , — 1. 

Next, let s and t leave Xa, • • • ,Xn unaltered. If s and t replace 
Xi, 0 C 2 , Xz by Xi, Xj, Xh and Xa, Xb, Xe, respectively, then F, = 
becomes 

mi Xi +m2X,* + m3 Xk — miXa + mzXb + m3 Xc* 

Equations of this type are the only ones of our p equations which 
involve only mi, m 2 , m 3 . Since the case in which rrh does not 
occur was treated before, we may assume that h 9 ^ c. For mi = 0, 
m 2 == 1 , the displayed equation reduces to 


Xh - x^ 

Hence we choose as m 3 any integer which is distinct from each of 
these fractions, finite in number. 

Next, let 5 and t leave xz, *,Xn unaltered, but replace Xa by 
different roots. For mi — 0, m 2 = 1, and for a chosen integral 
value of m 3 , F« = F i expresses m 4 as a fractional function of the 
roots. Choose as m 4 any integer which is distinct from each of 
these fractions. 

In the final step of the proof we employ substitutions s and t 
which alter Xn, and choose mn as an integer distinct from each of 
a finite number of fractional functions of the roots. 


Exercises 

1. Tlie roots of are iCi — — 1, xz ~ i, a?# =® — 

Show that % + ms rcs is six-valued if ms is distinct from 0, 1, ± z, 1 ± i, 
i (1 ± i), and hence if ms = — 1. Thus also a ?2 — a;i is six-vfdued. 
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Denote the six substitutions on xi, X 2 i xs by 

7, a « (12), b = (13), c = (23), d = (123), e = (132). 

Compute the following functions and verify that they are distinct: 

Fi = — ail = 1 + i, Fb s= a:2 — aJs = 2i, Fc = ajs — =® 1 — t, 

F«="Fi, Fd--76, 

2. If a;i,..., are independent variables, the fimction (3) has n\ distinct 
values if TWi,..., w» are any distinct constants. 

87. Galois resolvents. Let si,.,., Sr denote the substitutions 
on ici,..., Xm where r ^ n\ and si is the identity substitution. 
Let Fi be the r-valued function (3). If we apply s,* to Fi and then 
apply Sk to the resulting function Va^, we obtain F«j, where 
Si = Sj Sk- When k is fixed, but j takes the values 1, 2,..,, r, 
then I takes the same values in some new order. Hence s* merely 
permutes Fi,..., F»,. amongst themselves. The elementary 
symmetric functions of these F^s are therefore symmetric func¬ 
tions of , Xn, and hence are pol 3 niomials in ci,..., with 
integral coeflBicients. Let F be any field containing ci,..., Cn. 
Thus the coefficients of the polynomial defined by the expansion of 

(4) P(F) ^ (F - Fi) (F - F.,) ... (F - F.,) 

are quantities in F, 

If P(F) is reducible in P, let (?(F) be that factor irreducible 
in F for which (?(Fi) — 0. But if P(F) is irreducible in F, take 
G(y) to be P(F) itself. In either case, G(V) = 0 is an equation 
with the root Fi whose coefficients belong to F and which is 
irreducible in F; it is called a Galois resolvent of equation (1) for 
the field P. 

The corresponding resolvent of the equation in Ex. 1 of §86 
for the field of rational numbers is evidently 


(?(F) ^ (F - Fi)(F - Fc) - F2 -- 2F + 2 = 0; 
but for the field Rii), the resolvent is F — Fi =* 0. 



§87] 


GALOIS RESOLVENTS 


163 


- Theorem 2 . Let <l>{xi 3 ..., Xn) he any 'polynomial^ with coeffir 
dents in a field F, in the roots Xi of an equation with coefficients 
in F, Let s be any substitution on the roots and let it replace 4> by 
and Vi by V,, where Vi is the nl-^alued function (3) with ivr 
tegral coefifidents. Then 


(5) 




where \ is a polynomial with coefifidents in F, while P' is the deriva¬ 
tive of the polyrumial (4) whose coefifidents belong to P, whence 
P'CV,) 9 ^ 0. Thus (hs is the same rational function p(F«) of F» 
that is of Vi, 

Let r = n!, Z = Sr> Evidently s* permutes , thi, in the 

same manner that it permutes Fi,..., Fz- Hence the fractions in 


( 6 ) 


X(F) ^ <f>i 


P(7) 

V -Vi 


* ■ • + ^ 2 


P(F) 
F- Fz 


are merely permuted amongst themselves by any substitution on 
Xiy... j Xn* Replacing P(F) by the product (4), we see that 
X(F) becomes a polynomial in F whose coefficients, are integral 
rational symmetric functions of Xi,..., Xn with coefficients ia 
Fy and hence are equal to quantities m F. For F = F« each 
fraction in (6) becomes zero except that with the denominator 
F — Fa. If in the quotient of the product (4) by F — F, we 
take F = F#, we get the value of the derivative of (4) for F == Y,. 
Hence 

X(F.) - <A*P'(F,). 


In case the degree of the Galois resolvent G (V) = 0 is less than nl, we may 
express the numerator and denominator of = X(7i)/P'(Fi) as polyno¬ 
mials of lower degree in Vi by means of G {7i) = 0. Let r (Fi) be the resulting 
fractional expression for ^i. It does not now follow (as in the last part of the 
theorem) that we can apply any substitution s to the members of the relation 
01 = r(7i) and obtain a correct relation 0 « = r(F,). To obtain the latter 



164 


GROUP OF AN EQUATION 


[Oh. IX 


we would need to reduce the fraction (6) by means of Giy^) — 0, which 
however is not laiown to hold imless s is one of the special substitutions for 
which Vb is a root of G (F) = 0. For the remaining substitutions s, F, is a 
root of a factor, other than (F), of P (F) in (4). This discussion brings us 
to a critical point in our theory and indicates the care which must be taken in 
its development. 

88, The group of an equation for a given field. Let F be a 
field containing the coefldcients of an equation (1) whose roots 
aJi,,.., OJn are distinct. Let the roots of a Galois resolvent 
G{y) = 0 of degree g be 

(7) Vu Va, 76, 

in which the subscripts denote the substitutions on aii,..., »„ by 
which these F’s are derived from 7i. 

Thbohbm 3. These g siibstitvMons 

( 8 ) 

form a group G, edUed Uie group of the given equation (1) /or the 
field F. 

We are to prove that the product rs of any two equal or distinct 
substitutions (8) is one of those substitutions. Take Vr as the 
function ^ in Theorem 2. Then 


Vr 


X(7i) 


Vr. = (Vr). 


X(7.) 


where X(7) and P'(V) are pol 3 niomials in V with coefficients in 
F. Since Fr is a root of ff(F) = 0, the equation 



X(F) 

P'(V) 
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is satisfied when V = Vi. The product of the left member by the 
fifth power of P' (U) is a polynomial H (P) in V which vanishes for 
V = Vi and has quantities in P" as coefdcients. Hence (§80) the 
root Vt of the equation (r(F) = 0 irreducible in P* is a root of 
H (F) = 0. Since P'(F») 9 ^ 0, we may divide H (F.) by the ^ 
power of P'(F,); we get 

Hence F,, is one of the functions (7), so that rs is one of the 
substitutions (8). 

Exampmi. For the field B of rational numbers, a Galois resolvent ofiB* + 

+ » + 1 = 0 was seen in §87 to have the two roots Vi and Yc. Hence the 
group of this cubic for R is {1, (xi xs)}. But for the field B (i), a Galois re¬ 
solvent is F — Fi = 0 and the group is the identity. 


89. Characteristic properties A and B of the group G of a given 
equation for a given field F. 


A. If a rational function mih coefficients in F of the roots of an 
equation with coefficients in F remains unaltered in value by aU of 
the svbstitviions of the group G of the equation for F, it is equal to a 
quarUity in F. 


Let be a quotient of two poljmomials in the roots with 
coefficients in F. The coefficients of the equation are of course 
restricted to values for which 4 /9^0. We have formula (6) and 
similarly 


/i(F. 

P'(F.) 


where m(F) is a polynomial in F with coefficients in F. Since 
'll 5^ 0, ju(Fi) 9 ^ 0. If s is any substitution of 0, then 4/, 9 ^ 0. 
For, if ai(F) = 0 has the root F„ it has also the root Fi in com- 
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mon with the irreducible Galois resolvent (r(F) = 0 (§80). 
Hence the functions 


_ \{V.) 
i'. n(y,) 


(s = 1, a, , p) 


are defined for each substitution s of the group 0. 

Suppose that these g functions are equal, or in other words that 
is unaltered by aU substitutions of (?. Then 


The second member is a rational symmetric function with coeffi¬ 
cients in F of the roots (7) of G(V) = 0 and hence is equal to a 
rational function of its coefficients, which belong to F. Hence 
(p/rp is equal to a quantity in F. 

B. Conversely, if a raiional function of the roots with coefficients 
in F is egucA to a quantity in F, it remains unaltered in value by dll 
of the substitutions of 0. 

Let 4/4/ = r, where ?• is in F. Then \(y)/it(V) — r vanishes 
for V = 7i. Hence the equation X(7) — rpiiV) = 0 with 
coefficients in F* is satisfied by every root F, of the irreducible 
equation 0(7) = 0 (§80). Hence 


X(7.) 4>. 

m(7,) rP, 


(s 1, a,b,..., p). 


so that <p/pis unaltered by all the substitutions of G. 

Since an n 1-valued function 7i with coefficients in a given field 
F can be chosen in an infinitude of ways, there are infinitely many 
Galois resolvents 0(7) = 0. Our definition of the group 0 of 
the given equation for the field F was based upon a single such 
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resolvent, which is determined by the special Vi chosen. But 
different functions Pi lead to the same group. In fact, O is uniquely 
determined by the given equation and field F. This is a consequence 
of the following 

Thboebm 4. The group of a given equation for a given field F is 
uniqudy defined by properties A and B. 

First, suppose that = {1, r,..., to] is a group for which 
property A holds. Then the coeflBcients of 

^(P) = (P - Pi) (F _ 7,) ... (7 _ 7„), 

being symmetric functions of Pi, Pr,...» Pm, are unaltered by 
the substitutions of H and hence are equal to quantities in F. 
Since the equation 4> (P) = 0, with coefficients in F, admits one 
root Pi of the irreducible Galois resolvent G(P) =0, it adnoits 
all of the roots (7) of the latter ( §80). Hence 1, a,..., p occur 
among the substitutions of H, so that G is a subgroup of H. 

Second, suppose that K = {1, p,..., is a group for which 
property B holds. Then the Galois function <?(Pi), being equal 
to the number zero of F, remains unaltered in value by the sub¬ 
stitutions of K, so that 

0 « (?(Pi) = G(y,) -- Giyt). 

Hence Pp,.. -, F, occur among the roots (7) of (r(P) = 0. 
Thus Z is a subgroup of G. 

If H = J? is a group for which both properties A and B hold, 
the two results show that this group coincides with G. 

In view of its repeated application below, we state the result 
obtained in the second case as the 

Theorem 5. If every rational function of the roots with coefiir 
dents in F which is equal to a quantity in F is unaltered in value by 
every substitution of a group K, then K is a svbgroup of the group G 
for F of the equation. 
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90. Transitive and regular groups. A group of substitutions 
on xij.,, i Xn is called transiUve if it contains substitutions Si 
which replace xi by Xi for i — 1,,.., n. Then it contains a sub- 
stitutionsr^ s,*which replaces rcf by a;,-. A group which is not transi¬ 
tive is called intransitive. A transitive group of order n on n 
letters is called regular. 

The symmetric group on n letters and the group 

(9) (?4= {I, (12)(34), (13)(24), (14)(23)} 

on four letters are both transitive. The latter is regular, while the 
former is not if n > 2. But {/, (12), (34), (12) (34)} is an 
intransitive group. 

Theorem 6. The order of any transitive group O on n letters is 
divisible by n. 

AH those substitutions of 0 which leave xi unaltered form a 
subgroup H. Since G is transitive, it contains a substitution Si 
which replaces xi by Xi. Hence 

G = J? + Hsz + Hss + ■ • * + HSn, 

where every substitution of the set JHSi replaces xi by Xi and 
hence is distinct from each substitution of any set Hs,- having 
j 7 ^ i. Ne3ct, every substitution « of G is in one of the sets Hsi. 
For, if t replaces xi by a;*, then fo*-! leaves Xi unaltered and hence 
is a substitution h of H, whence t - hs^. Hence the order of 0 
is the product of the order by H by n. 

Theorem 7, If an equation is irredvmble in a field F, its group 
for F is transitive, and conversely. 

Let f{x) =0 be irreducible in F. Contrary to the theorem, 
suppose that its group G for F is intransitive and contains sub¬ 
stitutions replacing xi by xi, x^i ^ , x^, but none replacing xi by 

one of Xm+i, •.. ,Xn- Consider any substitution s of G and let it 
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replace Xi {i ^ m) by Xj. Since G contains a substitution t which 
replaces Xi by Xi, it contains U which replaces Xi by Xj. Hence 
j ^ rrij so that s permutes xi,.,. ,Xm amongst themselves, and 
leaves unaltered any sjonmetric function of them. Hence property 
A of §89 shows that the coeflScients of the polynomial in x given 
by the expansion of 

g{x) = {x - xi)(x - xz) • • • (a; - 

are quantities of F. Thus / (x) has the factor g(x) in F, contrary 
to its irreducibility in F. 

To prove the converse of the theorem, let G be transitive and 
suppose that/(x) is reducible in F. Thenf(x) has a factor g(x) 
of degree m <n with coefficients in P, such that g(x) =0 has 
the root rci. Since g{x\) is equal to the number zero of F, property 
B shows that it is unaltered in value by every substitution of (?. 
Since G is transitive, it contains a substitution replacing Xi by any 
chosen Xi (i S n). Hence = 0, in contradiction with 

m < n. 

Example. Find the group (? of re® — 7aj + 7 = 0 for the field It of rational 
numbers. 

Since the equation is irreducible in E (Ex. 3, §81), G is transitive. By 
§67, the square of 

( 10 ) 4^ = (xi- Xi) (xi - xs) (xi - xi) 

« is the discriminant 49 of our cubic equation, so that 4/ is equal to a number 
± 7 of R. Since any transposition replaces ^ by — ^ and hence alters ^ in 
value, it is not in by property B. The two results show that G = {7, (123), 
(132)}. 


Exercises 

1. For any field F containing the coefficients of an equation having a ra¬ 
tional root Xi and no multiple root, each substitution of its group leaves xi 
unaltered (by property B), If a root a;,* is not in F, the group contains a 
substitution which alters Xj. Hence prove that the group of (a; — 1) (a; -|- 1) 
(a: — 2) =0 for the field E of rational numbers is the identity. But the 
group of a;® — 1 = 0 for E is of order 2. 
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2. The group of jc® — 9a; + 9 — 0 for R is of order 3. Hint : The value of 
(10) is here ± 27. 

3. The group of a;® — 2 — 0 for R (w), where w is au imaginary cube root of 
unity, is of order 3. Use (10). 

4. The group of a;® — 2 = 0 for R is the symmetric group. For, if it were 
the alternating group, (10) would have a rational value. 

5. Find the group of a;* + 1 « 0 for R. 

Hints: By Ex. 1, §71, yi — XiXz -h xz X 4 , 2 / 2 , and yz are the distinct roots 
0, ± 2 of 2 /® — 4y = 0. Property B shows that each substitution s of G leaves 
2 / 1 , 2 / 2 , and yz formally unaltered. By Ex. 4, §73, s belongs to the group (9). 
Since a;^ + 1 is irreducible in R by Ex. 4, §80, G is transitive and hence of 
order ^ 4. Thus G = G 4 . 

6. The group of a;* + 1 = 0 for (i) is of order 2. 

Hint: The factors ± i show that the group is an intransitive subgroup 
of (9). 

7. If ao a;” + ai a;””i an = 0 has rational coefficients and is irre¬ 

ducible in Rj then 

(a) If there is a complex root of absolute value unity, the equation is a 
reciprocal equation of even degree. 

(b) If there is a root r + si, where r is rational, then n is even and the n 
roots may be paired so that the sum of the two of any pair is 2r, whence 
r = — aj (n ao). In particular, if r = 0, the equation involves only even 
powers of x. 

(c) If there is an imaginary root a hi such that a® -H 6® is rational, then 
n is even and the n roots may be paired so that the product of any two of a 
pair is a® + If p is a square root of a® -|- b®, then an/ao. Replacing 
X by p2/, we obtain a reciprocal equation in y. 

8. The group for Eofa;* + a;®-|-a;^+a;-|-l ^Oisa cyclic group of order 
4. It is irreducible in iS by Ex. 2, §83. 

9. Let G be the group for F of an equation irreducible in F whose n roots are 
rational functions fi{r) of one root r with coefficients in F for i = 0, 1, , . ., 
n — 1. Prove that G is a regular group and that fi\Si (r)] is a root. 

91* Rational functions belonging to a group. Consider a ra¬ 
tional function — , Xn)^ with coefficients in a field F, of 
the roots of an equation with coefficients in F. Let 0 be the 
group for F of the equation. Let 1, a, 6,..., fc be aU those sub¬ 
stitutions of O which leave ^ unaltered in value. Since ^ 
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has the value zero, we have ^6 — lAas = 0 by property Hence 
^ab = so that db is in the set 1, a, ,.. ,ky which therefore 
forms a group H, We shall say that ^ belongs to the subgroup 
H of Q. 

Example. Let Xi and cc ^ = oii be two roots of + 1 = 0. The only 
substitutions of its group (9) for B which leave xi^ unaltered are the identity 
and (12) (34). Hence they form the subgroup to which Xi^ belongs. 

Conversely, let H be any given subgroup of G. Let Vi be any 
n 1-valued function of the roots with coefficients in F, and let 
Vij Va,... pVk he the functions obtained from Vi by applying 
the substitutions of H. We may choose an integer r such that 

(r- 7i)(r- 7.) ••• (r - 7^) 

belongs to H. For, if s is any substitution of H, then s, as, ..,, ks 
are distinct and are in H, and hence form a permutation of 1, 
tty... ,k. Thus ^ is equal to 


(r-> 73)(r- 7a.) ••• (r - 7,a). 


But if s is a substitution of G which is not in H, then is not 
identical with ^ for all values of r, since 7. is different from 7i, 
Va, •.., 7ft. As in §86, we may choose an integer r such that 
every such ypa is distinct from Then ^ belongs to H. This 
proves 

Theoeem 8. Every rational function ^ with coefficients in F of 
the roots of an equation having the group 0 for the field F belongs to a 
definite subgroup of 0. There exist such functions rp belonging to any 
assigned subgroup of 0. 

We next prove the important supplementary 

Theorem 9. If a rational function \J/ with coefficients in F of the 
roots of an equation having the group 0 for F belongs to a subgroup 
H of index u under G, then the substitutions of 0 replace \[/ by exactly 
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u distinct functions, called ike conjugates to 4' under G. They are 
the roots of an eguaiion with coeffidenis in P which is irreducible 
in F. 

As in §76, let 

(11) G = H + Hg, + Hgz + ■■■+ Hgu. 

Let h be any substitution of H. Then 

= 4oii = 'l>i, 

so that 4' takes at most u values under G. If = 4's for j < i, 
then 4'p = where p = gtgj-^ is & substitution of G leaving ^ 
unaltered and hence is in Then gt = pgj, contrary to (11). 
Thus 4'h ' • 'i4>u are distinct. 

Any substitution s oi G merely permutes 4'h • •• f4^u amongst 
themselves. For, s is in and hence by (11) may be expressed 
in the form hgtc, where A is in IT; then 

(^»)« = 4'Bi* = 4^ist — 'i'Bj, = 4>h- 

Hence s leaves unaltered their elementary symmetrio functions 
which are, apart from sign, the coefficients of the polynomial 

(12) giy) = (y - 4'i)(y - 4'^) iy- 4'u). 

Therefore these coefficients are quantities in F by property A. 

Ji g(y) ia reducible in F, it has a factor t(y) with coefficients 
in F which is zero for y = ^i. By property B, tij/z) = 0,..., 
ti4'u) = 0. Thus t ^ g, 80 that p(y) is irreducible. 

ExiMPUB 1. The group (? of + a;* + ® + 1 = 0 for iJ is {J, (23)} if 
*1 = — 1. The conjugates to = *2 — »i under Q are ih — i' and 1^2 = 
— * 1 ; they are the roots of j/* — 2y + 2 == 0, which is irreducible in R. 
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Example 2. The group (? of + 1 = 0 for iZ (i) is {/, s), where s is a 
product of two transpositions. See Ex. 6, §90. Write asi = p, iCjs = ip^ ajs =* — p, 
a ;4 = — iPi where p = (1 + whence p® = i. Then s = (13) (24). 

The conjugates xi and to xi under G are the roots of 2/® — ^ = 0, which is 
irreducible in (i) since this held does not contain p. 


Theobem^ 10. Let 4> he a rational function with coefficients in 
F of the roots of an equation whose group for F is (?, If <f> remains 
unaltered by all those substitutions of Q which leave unaltered another 
rational function ^1/ of the roots with coefficients in F, then <t> is equal 
to a rational function of yp with coefficients in F, 

Let H be the subgroup of G of index u to which yp belongs. By 
means of (11), we obtain the u distinct conjugates ypi, •. •, pu to 
p ^ Pi imder 0, Since every substitution h oi H leaves p un¬ 
altered, each product hgi replaces phy pi ^ pg^. By the proof 
of Theorem 9, any substitution s of G replaces by a certain 
pk, and likewise pi by pk- Thus, if g(y) is defined by (12), the 
fractions in 


^ Pi y — ^2 V — \b.. / 


y - Pu‘ 


are merely permuted by s. The second member is equal to a 
polynomial X(y) each of whose coefficients is therefore unaltered 
by every substitution s of G and hence is equal to a quantity in F. 
Taking pi = p s^e y, we get (as in the proof of Theorem 2) 

P = X(^)/y'(^). 

Example. Let p be an wl-valued function Vi, so that the identity is the 
only substitution leaving p unaltered. Since it leaves every p unaltered, 
Theorem 10 states that every rational function p with coefficients in JP is 
expressible as a rational function of Vi with coefficients in F, This follows 
from the like property in §87 of the polynomial numerator and denominator 
of p, 

^The case in which the roots are independent variables is due to Lagrange. 
Nouv. M6m. Acad. Berlin, ann6es 1770-1, §§ 100-4; GSuvres, III, 374-88. 
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92. Effect on the group by.an adjunction to the field. Let G be 
the group of/(a:) = 0 for a field F containing the coefficients. The 
field F' = F (if-) composed of the rational functions of f with 
coefficients in F is said to be derived from F by adjoining ^ to F 
( §77). If the irreducible Galois resolvent GCF) = 0 for the field 
F remains irreducible in F', the group for F' of f(x) = 0 is evi¬ 
dently 0. But if it reduces in F', let 0'(.V) be that factor of 
(j(7) which has its coefficients in F', is irredueible in F', and 
vanishes for V = Vi. Then, if Fi, Fo,..., F* are the roots of 
G'(F) = 0, the group for F' of /(a:) = 0 is by definition (?' = 
{7, a,..., h}, which is a subgroup of (?. The former result may 
be combined with this since a group is included among its sub¬ 
groups. Hence we have 

TheobeucII. By an adjunction to the jUtld, the group of an 
equation is reduced to a subgroup. 

"ExAtiBiM 1. For the field R of rational numbers the group of + a? -b 
a; 4 -1 0 is G = |J, (23)} if xj denotes the root — 1; but in the enlarged 

field 12(f), the group is the identity (Example in §88). Note that the adjoined 
quantity i = x» belongs to the identity subgroup of Q. 

"Examssm 2. The group for 12 of x* + 1 «» 0 is ft, given by (9). In the 
notations of Ex. 2, §91, the group for 12 (f) is ft = (13) (24)). The latter 

is the subgroup of ft to which i=»i*=xj* = — —z^ belongs. 

These examples illustrate also the important 

Theobem 12. Let G be the group for F of an equation mfh coeffir 
dents in F. Let H he ffie sid)group to which hdongs a rationed funo- 
tion rp of the roots with, coefficients in F. By the adjunction of ^ to F, 
the group of die equation is reduced from G to H. 

It is to be proved that H has the characteristic properties A 
and B of the group of the equation for the field F' = F(^). 
Evidently any rational fimction ^ of the roots with coefficients in 
F' is equal to a rational function of the roots with coefficients 
in F. 
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First, let be unaltered by all of the substitutions of i 
Theorem 10, (which is unaltered by 2?) is a rational fn 
of ^ with coeflSicients in F. Hence is in F' — F (^), s 

property A holds for H and F\ 

Second, let 4 > be equal to a quantity r in F', namely a n 
function r(^) of ^ with coefficients in F. Then — r(v 
rational fimction of the roots with coefficients in F having th( 
zero, and hence is Unaltered by every substitution of (? a 
particular, by the substitutions of H, The latter leave 
imaltered and therefore also <t>i = This proves property 
H and F\ 


ExigRCISES 

1. By the adjunction of 2 the group Gj of a;* + 1 ~ 0 for 22 (i) is: 
to the identity group. 

2. By the adjunction of an imaginary cube root w of unity, the g 
ioT Rofa? — 2 = 0 is reduced to the cyclic group Cz (Exs. 3,4, §90). 
that « = xa/xi belongs to the group C®. By the further adjunction oi 
group is reduced to the identity. 

3. Find the group of a:* + x® + ic® + ® -f 1 ~ 0 for 22 (5*)* 

93, Group of the general equation. Let the coefficients c 
Cn of (1) be independent complex variables. Let Xij.., jXn 
roots in the sense of §79. 

Theoeem 13. The growp of the general equation for the . 
determined by its coefficients and any constants finite in nm 
the symmetric group. 

Suppose the group is H = {1, a,..., fc}, which does nc 
tain the substitution s. Apart from sign, the elementary sym 
functions of Fi, Va ,..., F* are coefficients of the Galois res 
and hence are equal to quantities in F. Hence if r is in F, 


(r^Vi)ir^Va) ••• (r-F,) 
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is equal to a quantity in F. As in §91, we may choose an integral 
value of r such that is distinct from But ^ is in F and hence 
is expressible rationally in terms of the symmetric functions 
Cl,..., c„ of the and is therefore unaltered by s. This con¬ 
tradiction proves the theorem. 


94. Further results. By an elaborate analysis, Hilbert ^ proved 
that if / is a polynomial in aji,.. ., Xr, ^i, ..., ia with integral 
coeflBicients which is irreducible in iS, it is always possible in an 
infinity of ways to assign integral values to the tj such that / 
becomes a function of the Xi irreducible in R. Let 0 (a;) =0 be an 
equation in x whose coeflScients are polynomials in ii,..., with 
integral coefiSicients. We can assign integral values to the such 
that the resulting equation in x has the same group for R as 
4 i{x) = 0. Applying this to the general equation, we conclude the 
existence of an infinitude of equations of degree n with integral 
coefficients whose group for R is the symmetric group. Similarly 
for the alternating group. The last two results have been proved 
more simply by various writers,^ 

A beginning^ has been made on the problem to find all equations 
of degree n with a prescribed group, with explicit results forn = 3 
and 71 = 4. 

An equation whose coefficients are rational functions of a com¬ 
plex variable k has a group which contains as an invariant sub¬ 
group the monodromie group composed of the substitutions on the 
roots which arise when k describes a closed path in the complex 


^ Joyr. fiir Math., 110, 1892, 104-29. In Ofversigt Finska Vetenskaps-Soo. F6i>- 
handlingar, 69, 1916—7, No. 12, WsisSJa proved that integral values may be 
assigned to the ij such -^at / decomposes into exactly as many irreducible factors 
as when the tj are variables. 

* Weber, Math. Papers Chicago Congress, 1896, 401-7; Algebra, ed. 2, I, 1898, 
653-^. Maillet, Jour, de Math., (5), 5,1899, 205-16. Bauer, Jour, fiir Math., 132, 
1907, 33-35; Math. Annalen, 64, 1907, 326-7. Schur, Jahresber. Deutsch. Math. 
Vereinigung, 29, 1920, 145-^50. FurtwSngler, Math. Annalen, 85, 1922, 34-40. 

*E. Noether, Math. Annalen, 78, 1918, 221-9. Seidelmann, ibid.t 230-3. Breuer, 
ibid., 86, 1922, 108-13 (cycHo sextics). 
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plane.'* Similarly there is a monodromie group of a linear differ¬ 
ential equation obtained by using linearly independent integrals 
(instead of roots as before). 

The group of a system of algebraic equations has been con¬ 
sidered.® 

The groups of a reciprocal quartic® and De Moivre’s^ solvable 
quintic® have been found. 

The group of the equation for the nine abscissas of the points of 
inflexion of a plane cubic curve has been treated fully by the 
author.® A more elaborate discussion is required for the 27 straight 
lines on a cubic surface or the 28 bitangents to a plane quartic 
curve.^® 

The Galois theory has been extended to ordinary linear differ¬ 
ential equations^ and to complete systems of linear partial differ¬ 
ential equations.^ 

^Miller, BHchfeldt, and Dickson, Finite Groups, 378-81. Hermite, Comptes 
Rendus Paris, 32,1851 (CEuvres, I, 275-80). Ftobenius, Jour, fur Math., 74,1872, 
254-72. Kneser, Irreducibilitat und Monodromiegruppe alg. GL, Thesis, Berlin, 
1885; Math. Annalen, 28,1886,125-32. Hurwitz,39,1891,23. Bianchi, Teoria 
dei Gmppi di Sostituzioni, 1897, 314-33. Baker, Annals of Math., 14, 1912-3, 
119-36. Ritt, Trans. Amer. Math. Soc., 24, 1922, 21-30. 

® Weisner, Bull. Amer. Math. Soc., 30,1924, 314-6 (concept due to KSnig, Math. 
Annalen, IS, 1881, 69-77). 

■Dickson and BSrger, Amer. Math. Monthly, 15, 1908, 71-78, 85-87. Miller, 
Blichfeldt, and Dickson, Finite Groups, 1916, 291. Criteria for irreducibility of a 
reciprocal equation of degree 2», Dickson, Bull. Amer. Math. Soc., 14, 1907—8, 
426-30. 

7Phil. Trans. London, 26. 1707, 2368-70. 

®B6rger, Amer. Math. Monthly, 15, 1908, 171-4. 

■MiUer, Blichfeldt, and Dickson, Finite Groups, 1916, 327-42; Annals of Math., 
16, 1914, 50-66. 

343-75. On pp. 375-7 is a brief account of the groups of further geometri¬ 
cal problems. 

Picard, Trait5 d’Analyse, III, Ch. 17. Loewy, H. Weber Festschrift, 1912,19S- 

227 . 

i^Vessiot, Annales Sc. ficole Normale Sup., (3), 21, 1904, 1-85. 



Chapter X 

EQUATIONS SOLVABLE BY RADICALS 

95. Historical note. Euler, the leading mathematician of the 
eighteenth century, believed that every algebraic equation is 
solvable by radicals. Beginning in 1799, Ruffini published several 
attempts to prove that the general equation of degree > 4 is not 
solvable by radicals, employing substitutions on the roots. His 
final and simplest proofs is essentially the same as that now known 
as WantzeFs simplification^ of AbeFs proof; but he gave only a 
trivial remark by way of proof of the auxiliary theorem, now known 
as AbeFs, that, if an equation is solvable by radicals, the roots can 
be given a form such that all the radicals occurring in their ex¬ 
pressions are rational functions of the roots [and roots of unity, 
with rational coefficients]. 

AbeFs® proof of the last theorem has two defects. As noted by 
Sir W. R. Hamilton,^ the nth root of is in general of order 
higher than pV». This first defect is best remedied by ignoriug 
AbeFs unnecessary classification of radicals. Second, AbeFs proof 
is vitiated by the occurrence of roots of unity and is remedied by 
adjoining to the field at the outset ah kth roots of unity, where k 
takes the values of the indices of all the radicals involved in the 
expression for a root y. 

But to give a sound proof of the impossibility of solving the 
general equation of degree > 4 along the hnes of Ruffini and Abel, 
we must introduce the ideas of groups of substitutions and adjunc- 


iRiflesdoni intomo alia soluzione delle equazioni algebraiche generali. Modena. 
1813. 

^Seiret, AIgSbre, H, eds. 4 or 5, 512. 

* Jour, fur Math., 1, 1826, 65-84; CEuvres competes, I, 1881, 66-94. 

^Trans. Royal Irish Acad., 18, 1839, 171-259. A very compUcated reconstruc¬ 
tion of AbeVs proof. 
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tions to fields, which served Galois in his far more general theory 
of the solvability of any equation. 

96. Solvability by radicals. The four rational operations are 
addition, subtraction, multiplication, and division. 

An algebraic equation is said to be solvable by radicals if all of 
its roots can be found by rational operations and extractions of a 
root, these operations being performed a finite number of times 
upon the coefficients of the equation or upon quantities obtained 
from them by those operations. Every cubic or quartic equation is 
solvable by radicals in view of Ch. VTI. One of our chief aims is 
to decide whether or not every equation of the fifth or higher 
degree is solvable by radicals. 

The above definition permits the use of the operation of finding 
one of the pth roots of a quantity previously determined, but not 
the use of the operations of finding aU its pth roots. The use of the 
latter operations would imply a knowledge of all the pth roots of 
unity, whereas one of our aims is to prove that, when p is a prime, 
they are all expressible in terms of radicals whose indices are < p. 

The solution of an equation solvable by radicals is usually ac¬ 
complished by the solution of a series of auxiliary equations the 
roots of any one of which can be found by rational operations and 
root extractions performed upon its coefficients and the coeffi¬ 
cients and roots of the preceding equations of the series. In order 
to focus our attention upon a particular equation of the series and 
to have a flexible phraseology, it will prove convenient to employ 
the following generalization of the foregoing definition of solvabil¬ 
ity by radicals. 

An equation with coefficients in a field jR(fci,..., km) shall be said 
to be solvable by radicals relatively to that field if all of its roots 
can be found by rational operations and extractions of a root per¬ 
formed upon fci,..., or upon quantities derived from them by 
those operations. 

For example, = 2^^ is evidently solvable relatively to i?(p), 
where p is a particular imaginary 13th root of unity. 
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97. Resolvent equations. Let 0 be the group for a field F of a 
given equation/(a;) = 0 with coefficients in A rational function 
^ of the roots with coefficients in F belongs to a certain subgroup 
H of index u under Q. By Theorem 9 of §91, i/- is a root of a re¬ 
solvent equation of degree u with coefficients in F. Assume that 
this equation is solvable by radicals relatively to F. By adjoining 
its root ^ to F, we obtain a field Fi = F^i}/) for which the group 
of / (a:) = 0 is now H (§92). Assume that successive such adjunc¬ 
tions lead to a field Fk for which the group of f(x) =0 is the 
identity. Then the roots all belong to i?’* by property A of §89. 

The assumptions just made will be shown to hold true when 
f(x) is a? + bx^ + cx + d, a b, c, d are independent variables, 
and F is the field ig(«, 6, c, d), where « = — | -f |(— 3)* is an 
imaginary cube root of unity. By §93, 0 is the symmetric group 
Ge on the roots xi, x^, Xs. To the cycKc subgroup C» belongs the 
alternating function 

d = (xi — xz) (xs - Xs) (xs — a:i), 

whose square is the discriminant A of the cubic. The resolvent 
equation S* = A is solvable by radicals relatively to F. Adjoining 
d = A * to jF, we obtain a field Fi = F(d) for which the group of 
our cubic equation is Cs. We employ the linear functions 4> yp 
of §66 and recall that, in the exercise foUowmg it, we verified 
that the substitution (132) of Cs replaces <l> by and yp by 
Thus all the substitutions of Cs leave <p^ and unaltered. The 
latter are equal to rational functions of d with coefficients in F 
(Theorem 10 of §91). The computations in §§66, 67 give 

4? = -f- 3d(— 3) *] = g, 4^ = i[B — 33(— 3)^] = h, 

<pp = — 3c, 

where B = 9bc - 2¥ - 27d. The resolvent equation = g, 
where g beloi^ to the field Fi = F{a), is evidently solvable by 
radicals relatively to Fi. In the enlarged field Fs = Fi(g*), 
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the group of the given cubic equation is the identity group to 
which 0 belongs. Hence its roots Xi aU belong to F 2 ; their expres¬ 
sions in terms of quantities in F 2 are given by formulas (5) of §66, 

98. Group of a resolvent equation. In our later discussion of 
the solvability by radicals of the resolvent equation defined at the 
beginning of §97, we shall need its group. Let ^ belong to the 
subgroup H of index u under G. In §91 we proved by means of 

(1) G = H -j- Hg2 + Hqz 4" • * * + Hqu 

that ^ is one of exactly u distinct conjugates under the group G: 

( 2 ) yp, ypazi • • - 

and that any substitution s of G replaces these by 

(3) 

which are merely the functions (2) rearranged. Hence to any 
substitution s of G on the letters Xi,... ,Xn there corresponds the 
substitution 



on the u letters (2). Similarly, to i of G corresponds 



where in the upper row of the final symbol we have adopted the 
order (3) of the letters (2). This is permissible since a substitu¬ 
tion is not changed if we rearrange the columns in its two-rowed 
notation. The product vt therefore replaces by and 
hence is equal to a substitution of type (4) which is derived from 
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(4) by replacing s by st. In other words, the set of all substitu¬ 
tions (4) is a group F. This group is transitive since s may be 
taken to be any gi. 

Theorem 1, To each substitution s of G corresponds a unique 
substitution o- defined by (4). All such substitutions (t form a iransi- 
tive group T. 

If to t corresponds r, we saw that to st corresponds ar. In other 
words, the correspondence between substitutions of G and those 
of r is preserved under multiplication. For this reason, G and T 
are said to be isomorphic. According as the order of G is equal to 
or exceeds the order of F, G is called simply or multiply isomorphic 
to F. 

Example. Let G be the alternating group on the independent variables 
xiy.,,, Zi. We have 

(?-a + (?4(234) +ft(243), 

(? 4 = {/, (12) (34), (13) (24), (14) (23)}. 

The indicated substitutions of order 3 replace ^ (zi ^ za) (zb — Xi) by 
^2 = (a:i — a;s) — 0 : 2 ) and ^3 = (iCi — 2 : 4 ) ( 0:2 — 2 : 3 ). Each substitution of 
G 4 leaves ^ 2 , and ^3 unaltered.^ Hence ^ belongs to ft and has only three 
conjugates under G, We have 

F “ {/, (^^2 Mi 

For, to each substitution of ft corresponds the identity of T. To all sub¬ 
stitutions of the set ft (234) therefore corresponds the same substitution 
(^^2 ^ 3 ) of r. Hence G is (4, 1 ) isomorphic to F. 


Exercises 

1. If G is the symmetric group and ^ is the alternating function, T = 

{/, (^2)}. 

2 , If G is the symmetric group on 211 ,, 2 : 4 , the group F on = xi X 2 + 
258 Xij viy Ui is the symmetric group of order 6. 

^This was shown in Ex. 4 of §73. See (22) of §71. 
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3. If (? is the symmetric group on xi, X 2 , xa, and if ^ is the six-valued function 
xi 4- &)» xa + wxa, T is simply isomorphic to G (cf. §67, Ex. 5). 

4. The group T is identical with that obtained by using another rational 

function ^ belonging to H and having its coefficients in F. Hints: By §91, 4> 
is a rational function r (^) of ^ with coefficients in F. By property B, ) 

is unaltered by every substitution t of G, whence ~ r(^i). Hence to any 
substitution on the u letters (2) corresponds the same substitution on the 
distinct letters ..., The converse is true since we may inter¬ 

change the idles of and 

5. Give another proof that T is determined by G and H alone, and is inde¬ 
pendent of If s is any substitution of G, prove that the products of the u 
sets in (1) by s on the right are the same sets rearranged, and that the result¬ 
ing substitution on the u sets is the same as (4) apart from the notation of 
the letters operated on. 

99. The special importance of the group T is due to 
Theobem 2. r is the group for F of the resolvent equation 

(5) giy) = (i/ - h)(.y - h) - (y ~ M ^0 

with coefficients in F. Here denotes yj/gf. 

To prove that T has the characteristic properties A, B of the 
group of (5) for F, note that any rational function B (^i,. • . , ^u) 
with coefficients in F is equal to a rational function r (xi,.. . , Xn) 
with coefficients in J?: 

(6) ... - r(xi ,..., x„) = 0. 

Since this (Merence is equal to the number zero of F, it is unaltered 
by every substitution s of the group ff on xi,..., x„. To s corre¬ 
sponds a definite substitution er of the group P on , fu. 

Hence 


( 7 ) 


RAh ,..., if'u) - r.(xi,. .., x„) = 0. 
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First, let R be unaltered by every substitution of T, so that 
R — Rg. for every <r in T. Comparing (6) with (7), we conclude 
that r ~ r* for every s in (?. Hence by property A for the group 
0, r belongs to’ F. This proves property A for the group F. 

Second, let R belong to F, By (6), r belongs to F. Hence by 
property B for the group G, r» — r for every s in G, Then by (6) 
and (7), R^ = R for every <r in F. This proves property B for the 
group F. 

Since F is transitive, equation (6) is irreducible in F. 

100. Invariant subgroup. Let ^|/ belong to the subgroup 
H — {Ai ^ I, h2} *. ^, hp] 

of G. Let h and s be any substitutions of H and 0 respectively. 
Then t = hs leaves unaltered, since 8“^ replaces by 
h leaves unaltered, and s replaces yj/ by Conversely, if t is 
any substitution of G which leaves ^a unaltered, so that yj/at = 
then 

= 'I'l- ste-i = h, 

where A is in H, whence t = hs. Hence ^a belongs to the 
subgroup 

Ai 5 = J, 5“^ h2S, s} 

of (?. This subgroup is denoted by Hs and is called the trans¬ 
form of H by s. Its substitution hi sis called the transform of 
^ibys (cf. §101). 

Theorem 3, Let yp belong to the subgroup H of index u under G. 
By means of (1), we obtain a complete set of conjugates to yp under 
G: 
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They belong to the respective groups 


Hj gz ^ Hgz) • • • ^ g^}" Hg^* 

The latter groups are said to form a complete set of conjugate 
subgroups of G. In case they are all identical, H is called an 
invariant^ (or self-conjugate) subgroup of G. In this important 
case, the substitution (4) is the identity if s is in ff, since s then 
leaves each ^g. unaltered; while any substitution s of G and the 
product hs, where A is in S', both correspond to the same sub¬ 
stitution (4). In other words, if we consider the sets H, 
etc., which by (1) make up all the substitutions of G, we see that 
all substitutions of H correspond to the identity of the group F, 
all of Hgz correspond to the same substitution of F, etc. To 
obtain T it therefore suffices to take the substitutions which 
correspond to I, ^ 2 ,..., gu- Since these replace ^ by w distinct 
functions, the resulting u substitutions of T are distinct. This 
proves 

Theorem 4. If H is an invariant subgroup of G of index u, the 
group r is a transitive group of order u on u letters and hence is a 
regular group. 

If w is a prime and tr is any substitution I of F, the order of 
0 - is w ( §76, Exb. 2, 3), so that the powers of a give all the sub¬ 
stitutions of F. This proves the 

Corollary. If H is an invariant subgroup of G of prime index 
u, then F is a regular cyclic group of order u. 

This corollary is illustrated by Ex. 1 and the example preceding 
it in §98. For each we shall prove in the next section that H is 
invariant in G. 


^ In other words, S is invariant in G if JET is transformed into itself by every sub¬ 
stitution of G. 
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101. Transforms of a substitution. There is a simple rule for 
finding the transform g~^ hg of A by gr without performing the 
indicated multiplications. Let A be a product rst • • • of cycles 
affecting different sets of letters. Evidently 


hg = g-^ rg •g-'-sg • • •. 


It remains to find the transform of a cycle r == (abc ... e) by 


-a 


6 c 
BC 


• e f 
■EF 


■ I' 


Then 


C---E F---L\ 
^ \ a b e ■■■e f •••I )’ 


g~^rg 


(ABC 
\BC B 


EF---L\ 

af---l)' 


Hence g~'^ rg = (ABC ... E) may be obtained by replacing 
each letter of the cycle r = (abc ... e) by the letter by which g 
replaces it, or expressed briefly by applying g within the cycle. 
Hence g-^ hg is obtained by applying g within the cycles of h. 
For example, 


(123)-i • (12) (34) • (123) = (23) (14). 

Since any substitution transforms a product of k transpositions 
into a product of k transpositions, and therefore an even sub¬ 
stitution into an even substitution, it follows that the alternating 
group on n letters is invariant in the symmetric group. The case 
= 2 shows that 


(?4= {I, (12) (34), (13) (24), (14) (23)} 
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is invariant under the S3mametric group 6u on four letters, since 
Oi contains every substitution (ob)(cd), where a, b, e, d form a 
permutation of 1, 2, 3, 4. 

A substitution is called irwariant in G if it is transformed into 
itself by every substitution of G. 


Exbkcises 

1. (?4 is an invariant subgroup of the alternating group on four letters and 
also of the groups Gg, Hg, Kg of §73. 

2. If Ris a subgroup of 0 of index 2, His invariant in G. Hint: <? = R + 
Hg = H + gH, whence the substitutionB of the set Hg form a permutation of 
those of gH. 

3. The only invariant substitutions of Qg in §73 are the identity I and 
a = (12) (34). Hence the only invariant subgroups of order < 4 of are 
the identity and {/, a). 

102. Simple and quotient groups. A group is called simple if 
it has no invariant subgroup other than itself and the identity 
group. A composite group is one which is not simple. A group 
whose order is a prime is evidently simple. 

Let i? be an invariant subgroup of index u of a group G. By 
§91 there exists a function f belonging to the subgroup H. Then 
the group F defined in §98 is of .order u (Theorem 4 of §100); it 
is called the quotient group oiGhy H and denoted by G/H.. Its 
order is the quotient u of the order of G by that of H. 

For example, if Gt and Qz are the symmetric and alternating 
groups on three letters, the alternating function v^i belongs to Gt 
and it takes a second value ^2 = — under Ga. Then Qg/Gt is 
the group F = {J, 

A proper subgroup of G’ is a subgroup distinct from G. 

In particular, let .IT be a maximcd invariant proper subgroup of 
0, so that H is not contained in a larger invariant proper subgroup 
of G. 
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Theokem 5. The quotient group T — G/H is simple. 

For, suppose that T has an invariant subgroup A distinct from 
r and the identity group. Write A = {ai == /, a 2 ,..., 
where a* corresponds to aU the substitutions of a set Hai form¬ 
ing a component of G in the notation (1), Write 

D = Hoi “h Hdi “h •••-[" Hudf Ui = /. 

Since A is a group, ai a,* = a*; whence Hai • Ha^ = Huk, 
so that D is a group. To every substitution g of G corresponds a 
substitution y of F. Since H is invariant in G, all the substitu¬ 
tions of the set g~'^ Hai g = H • g~^ at g correspond to y"^ ai 7 , 
which is a substitution of the invariant subgroup A of F. 
But the substitutions of Ha^ alone correspond to a/. Hence 
g'^^Hai g = Ha,-, so that B is invariant in G. Since A is distinct 
from r and the identity, D is distinct from G and H. Hence H 
is contained in the larger invariant proper subgroup D of (?, 
contrary to hypothesis. 

103. Series and factors of composition, solvable groups. Any 
group G contains the identity group (?i, which is invariant in (?. 
Since there is only a finite number of sets of substitutions in a 
group G of finite order, G has a finite number of subgroups. 
These two remarks show that G has a maximal invariant proper 
subgroup H. Similarly, H has a maximal invariant proper sub¬ 
group K. Such a series of groups G, H, X,.., M, Gi, terminating 
with Gi, is called a series of composition of G. If H is of index u 
under G, K of index v under H,..., and if w is the index of Gi 
under M (i.e., the order of M)y the positive integers w, ?w 

are called the factors of composition of G (relative^ to the series 
G,H,...,Gi). 

^ But in fact the same for all series of composition of G, a result due to Jordan, 
Trait6 des Substitutions, 1870, 42. It is a corollary to the theorem of Haider 
(Math, Annalen, 34, 1889, 37; simplified by G. L. Brown, Bull. Amer. Math. Soc., 
1, 1894-5, 232-4) that the quotient groups G/H, H/K^... ^ M bxq identical in 
some order with those of any series of composition of G. Cf. Miller, Blichfeldt, and 
Dickson, Theory and Applications of Finite Groups, 1916,175-6. 
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A group is called solvable if its factors of composition are aU 
primes, otherwise insolvabh. 

Exercises 

1. If (t 6 and Ga are the symmetric and alternating groups on 3 letters, the 
only series of composition of is Gsj Gs, ft, and Ge is a solvable group. 

2. A cycHc group of prime order is a solvable group. 

3. The symmetric group on 4 letters has the series of composition fti, 
fta = alternating group, ft (in Ex., §98), ft, ft, there being three choices 
for ft. Hence Gu is a solvable group. 

4. If Cm is generated by a = (xiXz • • * rcu), the only proper subgroups are 

ft = {I, a®, a\ o«, a8, a^o}, ft = {7, a^, a®}, ft = {/, aS a^], ft « {/, a«}, 

and ft = {/}. Every subgroup of a commutative group is invariant. The 
only series of composition of Cu are 

Oi2j ft, ft, ftj Cuf Oe, ft, ft 5 Oi2j ft, ft, ft. 

The factors of composition are 2, 2, 3; 2, 3, 2; 3, 2, 2, respectively. 

5. If G has a subgroup H of index 2, H a subgroup K of index 2, etc., to the 
identity group ft, then G has the series of composition G, Hj K,..., ft. 

6. Find the seven series of composition of ft in §73. 

7. Prove that the three series of composition in Ex. 3 are the only ones. 

104. Theorem 6. The solution of an equation having the group 
G for the field F can be reduced to the solution of a series of equations 
each having a simple regular group for the field obtained by adjoining 
to F a root of each of the earlier equations of the series. In particular^ 
if G is a solvable group, each auxiliary equation has a regular cyclic 
group of prime order. 

Let G, H, K,... ,Gi he a, series of composition of G, and u, 
V,, be the corresponding factors of composition. Construct 
a rational function yp of the roots Xi with coeflicients in F such that 
yp belongs to the subgroup H oi G oi index u ( §91). Then ^ is a 
root of an equation of degree u with coeflS-cients in F whose group 
r for F is simply isomorphic with the simple quotient group 
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Q/H (§§99, 102). After the adjimction of the root ^ to F, the 
group of the given equation becomes H iot the field F{^) by 
§92. 

Construct a rational function x of the Xi with coefficients in 
Fiyp) such that x belongs to the subgroup K oi H oi index v. 
Then x is a root of an equation of degree v with coefficients in 
F{^) whose group for that field is simply isomorphic with the 
simple group HfK. By the adjunction of x> the group of the 
given equation becomes K for the field x)- 

The final such step is the adjunction of a function belonging to 
<?i. The resulting field contains each Xi. 


105. Equations with a cyclic group. The last result is sup¬ 
plemented by the following 

Theorem 7. An equation, whose group 6 for a fidd F containing 
an imaginary plh root p of unity is a regular cydie group of prime 
order p, is solvable hy radicals rdativdy to F. 

Let the roots be xo, Xi,. .., Xp-i and let G be generated by the 
substitution s = (xoXi • • • We see that s replaces 

(8) ri = Xo + p*a!i + p^^xt + -f- Xp-i 


by p~* ri and hence leaves a,- = unaltered. Since the coeffi¬ 
cients of Vi belong to F and it is un;^tered by all the substitutions 
s* of G, Vi is equal to a quantity in by property A of §89. Evi¬ 
dently — ro is a coefficient of the given equation. The p linear 
equations (8) for i = 0, 1 ,..., p — 1 can be solved for the p 
unknowns Xp-i by the method employed in '§66 for the 

case p — Z. Multiply (8) by where 0 < p, and sum for 

i = 0,1,..., p — 1. In the resulting relation, the coefficient of 
®,- is 1 + • • • -b 1 = p, while, for k j, the coefficient of is 


p-i 

t-O 



(< = - j), 
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since t is not divisible by p. Hence ^ 


p-i 

pxi = ro + 2 P“’^ O' = 0» 1> • • • > P — 1)- 

i-1 

106. Cyclotomic equations. To perfect the last theorem, it 
remains to prove that we can find by root extractions the imagi¬ 
nary pthroots of unity, where p is an odd prime. They are the suc¬ 
cessive powers of one of them, say p. For 2 > = 6, they may be 
arranged in the order 


p, p^, p®* = p\ p^® = p*, 

so that each is the square of its predecessor. 

Since the discussion does not become more complicated, we shall 
treat the generalization from p to p* and hence obtain results 
needed also in our later study of regular polygons. In §83 we 
saw that the primitive ninth roots of unity are 

A a2 a4 rfi a1® = a7 

Pj P } P } P } P P i P 

each of which is the square of its predecessor, and that the e = 
p* — p«“^ primitive p®th roots of unity are those powers of one of 
them, say p, whose exponents are the positive integers less than 
p® and not divisible by p. The above facts concerning fifth and 
ninth roots of unity may be generalized as follows: It is shown in 
the theory of numbers that there exists an integer g such that, 
when 1, g, ..., are divided by p®, the positive remainders 

< p® coincide in some order -with aU the positive integers < p® 

which are not divisible by p, and hence give the mentioned ex¬ 
ponents of p. In other words, the primitive p* roots of unity are 

3/1 “ p, 2 J 2 “ P^, 3/3 “ p^,..., 3/^ — p*^ • 

1 Lagrange, Traits resolution equations, 1808, Note XXXI, §16. Implicitly in 
Nouv. Mem. Acad. Berlin, ami6e 1770, §71; CEuvres, XII, 333. 
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Hence 


(9) a%j = Xi”, xa — xt”,... ,Xe = «! = a;,". 

The final relation follows from Euler’s theorem that g* is of the 
form 1 kp‘, where k is an integer (as shown in the theory of 
numbers). Compare the above cases p‘ = 5, p‘ = 9. 

We proved in §83 that the equation having these roots Xi,, 
X, is irreducible in the field R of rational numbers. Hence its group 
G for jB is transitive. Let 

_ / aJi Xt Xa-' -XeX 
VSo Xt Xo - • -xi) 


be any substitution of G. By property B of §89, it follows from 
(9) that 

Xi = Xa”, Xe = Xi", . . . , Xa = Xl". 

But ®o+i = Xa", etc., by (9). Hence 


Xj) — Xa+1) Xe — • • • 3 Xa — 

provided the new symbol a:«+i be replaced by Xi. If two integers 
u and V differ by a multiple of e, we write u = v (mod e). Hence 

ftso + l, csh + l = a4-2,... (mod e), 

^^/xi Xi Xa •■•Xk ■••X, \ 

V Xa Xa-fl Xa+2 ' * * • • • Xa.\.a..‘l • 

in which x,+j is to be replaced by a:,-. Hence 

s = t = {xiXiXa • • • X,). 

Since G is transitive it contains all the powers of t. 

Thbobem 8. If p is an odd prime, the group for the field of ror 
tional numbers of the cydotomic eguoMoh whose roots are the primitive 
p‘th roots of unity is a regular ay die group of order e = p‘ — p’-K 

This theorem was verified for p*=5 in Ex. 8, §90. 
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107. SuflBlcient condition for solvability by radicals. An equa¬ 
tion having a solvable group for the field determined by the 
coeflS.cients is solvable by radicals. We shall prove the more 
general 

Theorem 9. An algebraic equation having a solvable group for 
any field F containing the coefficients is solvable by radicals relatively 
to F. 

Assume for the moment that the theorem holds for every equa¬ 
tion having for F a regular cyclic group of prime order. We shall 
then prove that any equation f{x) — 0 whose group for F has 
prime factors of composition u, v,... is solvable by^ radicals 
relatively to F, As in §104, there is a series of equations =0, 
m(x) = 0,... of prime degrees u, z;,..., the solution of which 
is equivalent to the solution of f(x) = 0. As there proved, the 
group for F of n{\l/) = 0 is a regular cyclic group of prime order 
u, wMch by our assumption is solvable by radicals relatively to F. 
The coefficients of m(x) = 0 belong to the field jPi == F(^) and 
its group for Fi is a regular cyclic group of prime order v( §104); 
by our assumption it is solvable by radicals relatively to Fi. It 
is therefore solvable by radicals relatively to F, since the same was 
just proved for — 0. Similarly, all of our auxiliary equations 
are solvable by radicals relatively to F, Since the roots of f{x) — 0 
are expressible rationally in terms of quantities in F and roots 
i/', X,... of these auxiliary equations ( §104), we see that/(a;) = 0 
is solvable by radicals relatively to F. 

It remains to justify the assumption made above that any 
equation C{x) — 0 having a regular cyclic group G of prime order 
p for a field F is solvable by radicals of indices ^ p relatively to 
F, This is evidently true when p = 2. To proceed by induction 
on Pj suppose that every equation having a regular cyclic group of 
order Qj where g is a prime < p, for any field D is solvable by 
radicals of indices ^ q relatively to D. As in the preceding para¬ 
graph, this implies that the equation for the imaginary pth roots 
of unity is solvable by radicals of indices < p (i.e., relatively to 
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the field R of rational numbers). In fact, its group for jB is a regu¬ 
lar cyclic group of order p — 1 (§106), whose factors of com¬ 
position are the prime factors of p — 1, and these primes are < p. 

Adjoin to F an imaginary pth root p of unity. The group of 
C{x) = 0 for the field F(p) is either the initial cyclic group G of 
prime order p or the identity group ( §92). In the latter case, the 
roots are in F(p) and can be found from quantities in by ration¬ 
al operations and extractions of roots of indices < p, since p was 
seen to be derivable from rational numbers by those operations, 
so that C{x) = 0 is solvable by radicals of indices < p relatively 
to F. In the former case, C(x) =0 is solvable ( §105) by radicals 
of indices ^ p relatively to ^’(p) and hence, as before, relatively 
to F. 

Hence the induction is complete and the theorem is proved. 

CoEOLLARY. If p is an odd prime, the equation for the p 1 
imaginary pth roots of unity is solvable by radicals of indices < p. 

Examplb 1. If hj c, d are independexit variables and <a is an imaginary cube 
root of unity, the group of + 5a;® + ca? 4* d = 0 for the field B («, b,c,d) 
is the symmetric group on the three roots (§93). Since it is a solvable group 
by Ex. 1, §103, the equation is solvable by radicals (of. §97). Note that after 
the adjunction of the square root of the discriminant, the equation has a 
cyclic group of prime order 3. 

Examplb 2. Consider the quartic equation whose coefficients are inde¬ 
pendent variables. For the field F determined by its coefficients, the group is 
the symmetric group (?24 on the roots. It is a solvable group by Ex. 3, §103. 
To the invariant subgroup Gi 2 j composed of the even substitutions, belongs 
the product P of the differences of the roots. The square of P is the dis¬ 
criminant a of the quartic equation. The group reduces to G 12 for the field 
Fi = P(P). The function yi = XiXn -hicsa;* is unaltered by ft =* {/, 
(12) (34), (13) (24), (14) (23)}, while the remaining eight substitutions of 
Gii replace 2/1 by yj or ys. These three y^s are by §91 the roots of a cubic equa¬ 
tion with coefficients in Pi (the resolvent cubic in §71). Since ft is invariant in 
<Ti 2 , the group of this cubic for the field P (P) is Cru/ft, which is a cyclic group 
of order 3. By the adjunction of yi to Pi, the group of the quartic reduces to 
ft. The function xi X 2 is unaltered by ft = {/, (12) (34)}, while the remain¬ 
ing substitutions of ft replace it by xs Xi, The adjimction of a root of ^ 
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2/1 2 + xi X 2 xzXi == 0 reduces G* to the invariant subgroup G 2 . Finally, the 
adjunction of Xi — Xi whose square belongs to Fi ( 2 / 1 , xi X 2 ) reduces G 2 to the 
identity group. The enlarged field contains xi, ..., Xa, which have been 
found by the extraction of three square roots and one cube root. 


Exercises 

1. Instead of 2/1 we may employ in Example 2 

ii^ = 4i/i + 6® — 4c, = oji + 0:2 — ics — Xi, 

where the notations of §71 are used. Evidently the transformation y » 
i (i — b® 4 “ 4 c) replaces the former resolvent cubic by one in t whose roots 
are the squares of h, t 2 - Xi Xs — X 2 — Xij is = xi + — af 2 — 0 : 3 . From 

these three linear relations and — b = 2) Xi, determine the roots Xi. 

2. Modify Ex. 2 by adjoining the cube root o) of unity to the initial field 

and employing instead of yi the function = 2/1 + + w® ys. Write 

^ = 2/1 + 2/2 + «2/s* Applying to the resolvent cubic (19) in §71 the results 

in §66 with x^s changed to y% show that 


^3 __ ^3 -= 3 ct>®)P, = c® — 3bd 4* 12e, 

,^3 + ^3 2c 8 - 9bcd - 72ce + 27d® + 27b® e, 

which are essentially the invariants I and J of §12. Hence ^ is found as a cube 
root and then p is known. Prom the values of and 2/1 + 2/2 + ^3 == c, 
we obtain 2 / 1 , 2 / 2 , 2/3 by solving three linear equations (cf. §66). For the field 
Pi obtained by adjoining P and </», the group of the quartic equation is G 4 . 
If also the root <1 of Zi® = 4yi -|- b® — 4c is adjoined, the group reduces to 
G 2 — [Ii (12) (34)}. Finally, if also a root t 2 of — 42/2 + 4cis adjoined, 

the group reduces to the identity and the x^s may be found as in Ex. 1. Note 
that h k h — 4bc — b® — Sd. The elegance of this solution is due to its 
employment of binomial resolvents only and to its introduction of the invari¬ 
ants I and J. 

108. Jordan’s 1 theorem on the mutual adjunction of roots. 

Theorem 10. Let the group Gi for a field F of an algebraic 
equation fi{x) = 0 fee reduced to Hx by the adjunction of all 
of the roots of a second equation f 2 (as) = 0, and let the group G 2 for F 
of the second equation be reduced to H 2 by the adjunction of all of the 

^ Trait6 des Substitutions, 1870, 268-9. 
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roots of the first equation. Then Hi and H 2 are invariant subgroups of 
Gi and (? 2 , respectively^ of equal indices, and^ the quotient groups 
0\/Hi and G 2 /H 2 are simply isomorphic. 

By §91, there exists a rational function gi with coeflSicients in F of 
the roots Xi,.., ,Xn of the first equation such that gi belongs to 
the subgroup Hi of Oi, Since the adjunction of the roots yi,,, 
y^ of the second equation reduces Qi to Hi, property A of Hi 
( §89) shows that gi is a quantity of the enlarged field, whence 

(10) gi{xi, . .., ajn) = Ai(yi,..., 2/m), 

where hi is a rational function with coefficients in F, Let gi,.., gh 
denote the distinct functions obtained from gi by applying the 
substitutions of Gi, Then Hi is of index k under Gi ( §91). Let 
hi,... ,hi denote the distinct functions obtained from hi by apply¬ 
ing the substitutions of G 2 . The k functions gi are the roots of an 
equation irreducible in F; likewise for the I functions hi. Since 
these two irreducible equations have a root (10) in common, they 
are identical by §80. Hence k = I and the g^s coincide in some 
order with the h^s. 

If Si is a substitution of Gi which replaces gi by gi, then Si trans¬ 
forms the group Hi to which gi belongs into the group to which gi 
belongs ( §100), whose order is that of Hi, Since gt is equal to a 
certain A / it belongs to the field F' F(yi,,,,, y„) and hence is 
unaltered by the substitutions of the group Hi oi fi{x) = 0 for 
that field JP' (property B of §89). Hence the group to which gi 
belongs contains aU the substitutions of Hi and, being of the same 
order as Hi, coincides with Hi, This proves that Hi is invariant in 
Gi, The group for F of the irreducible equation satisfied by gi is 
therefore the quotient group Gi/Hi ( §102). 

iThis supplement and the proof here employed are due to HSlder, Math. An- 
nalen, 34, 1889, 47. For generalizations and related results, see Kneser, ihid.^ 30, 
1887,179-202; shorter proof of theorem on p. 195 in Jour, filr Math., 106,1890, 61. 
Landsberg, ibid., 132, 1907, l'-20. Bucoa, Rendiconti Circolo Mat. Palermo, 14, 
1900, 122-6. Loewy, Math. Zeitsohrift, 15, 1922, 261-73. 
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Let K be the subgroup of Gt to which hi (j/i ,.. .ym) belongs. It 
is of index k since hi is a root of an equation of degree I = k irre* 
ducible in F. By the adjunction of hi to F and hence of gi by (10), 
the group (Ja of fi(x) = 0 for F is reduced to if (§92). If not 
merely gi{xi, ..., »«), but all of the Xi themselves be adjoined, the 
group Oi reduces to a subgroup of K. Hence is contained in K, 
This proves the preliminary result: If the group of /i(a:) = 0 
reduces to a subgroup of index k on adjoining all of the roots of 
fi{x) =0, then the group of ft(x)=0 reduces to a subgroup of index 
ki(ki ^ k) on adjoining all of the roots of /i(a:) = 0. 

Interchanging fi and /a in the preceding statement, we obtain 
the following result: If the group of /a (a:) = 0 reduces to a sub¬ 
group of index ki on adjoining all the roots of fi(x) — 0, then the 
group of fi(x) = 0 reduces to a subgroup of index ^(Jba ^ ki) on 
adjoining all of the roots of /a (a:) = 0. Since the hypothesis for 
the second result is identical with the conclusion for the first 
result, we conclude that h = k, and fci > ft ^ fti, whence fti = ft. 
Since Ha is contained in K, and Ha is of index fti under Gs and K 
is of index ft under Ga, it follows that Ha and K are identical. 

For the same reason that Hi is invariant in Gi, it follows that Ha 
is invariant in Ga. Since hi belongs to the subgroup H = Ha of 
Ga, the group for F of the irreducible equation satisfied by hi is the 
quotient group Ga/Ha. Since this equation was seen to be identical 
with that satisfied by gi and since the group of the latter equation 
for F was seen to be Gi/Hi, the groups Ga/Ha and Gi/Hi differ 
only in the notations employed for the letters on which they oper¬ 
ate, and hence are simply isomorphic. 

109. Galois’s theorem on adjunction. 

Thboeem 11. By the adjundion of any one root of an equation 
/a (a:) = 0 whose group for a field F is a regtdar cyclic group of 
prime order p, the group for F of Uie equation fiix) = 0 eithet is 
not reduced at aU or dse is reduced to an irwariant subgroup of 
index p. 
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This is a corollary to Jordan^s theorem, since if we adjoin one 
root Xi of f 2 (x) = 0 we reaUy adjoin all of its roots. For, the 
identity is the oiJy substitution of the group for F of f 2 (x) - 0 
which leaves Xi unaltered in value, whence each root is a rational 
fiinction of Xi with coefficients in F (Theorem 10 of §91). 

110 . Galois’s criterion for solvability by radicals. An algebraic 
equation is solvable by radicals if and only if its group for the field 
determined by the coejfficients is a solvable group. 

It is occasionally useful to have the generalization: 

Theorem 12. An equation is solvable by radicals relatively to 
any field F containing the coefifidents if and only if its group for F is 
a solvable group. 

That the equation is solvable when the group is a solvable group 
was proved in §107. We shall now prove the converse. Assume 
therefore that the roots Xij... yXn can be derived by rational 
operations and root extractions from quantities in F or from quan¬ 
tities obtained from them by those operations. The index of each 
root extraction may be assumed to be a prime since a pgth root is 
a pth root of a gth root. Such a two-story radical as well as the 
underneath radical are both listed in making a list of all the 
radicals 

(11) , a*’/'* 

which occur in the expressions for Xi ,Here ai is in F, oa 
is in the field derived from F by adjoining the first radical, 03 
is in the field derived by adjoining the first two radicals, etc. Also 
pi,... ,Vh are primes. 

By the corollary in §107, the equation for the imaginaiy pth 
roots of unity (where p is a prime) is solvable by radicals of indices 
< p: Those roots may therefore be expressed rationally in terms 
of radicals forming a chain of type (11), where now F is the field of' 
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rational numbers. Write down the radicals of the chain for 2 ? = 3 
and foUow them by the radicals of the chain for — 5 , and so on 
for the primes in order up to and including the maximum of 
Vh • • • > After the last of these radicals write the radicals (11) 
in order. We obtain a chain of radicals 

( 12 ) . .., 

having the following properties: First, gi,..., ga are primes. 
Second, 6 i is in the field F, while 62 , 63 , .. . are in the fields derived 
from F by adjoining the first radical, the first two radicals, -.., 
respectively. Third, a;i,..,, are rational functions with coeffi¬ 
cients in F of these radicals, since they were such functions of the 
included radicals (11). Fourth, after the adjunction to F of the 
first r — 1 radicals ( 12 ), we obtain a field containing all the 
imaginary grth roots of unity. 

Lemma. For afield containing A and an imaginary pth root p of 
unity f where p is an odd prime, the group of ^ A is the identity 
group if one root belongs to the field, hut is a regular cyclic group of 
order p if no root belongs to the field. 

For, if one root is r, the remaining roots are pr, fPr,,., ^ r. 
All of the latter belong to the field containing p if r does, and the 
group is then the identity. Next, let no root belong to the field, 
so that A is not the pth power of a quantity of the field. Then 
njp — A is irreducible (§82), so that the group is transitive and 
hence of order ^ p. The notation for the roots Xi may be chosen 
so that 

X 2 “ P^lj ^8 ~ • • * > ~ P^p—l} “ P^p* 

Consider any substitution 

^ / Xl X2 Xs • * • Xp 
^ \Xa Xh aJe * • • Xi 
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bf the group. By property B of §89, we have 


iCfi ““ pXa ^0 *” P^b • • • 


As in §106, s is the power a — 1 of (xiXiXa ••• Xp), which there¬ 
fore generates the group. 

Let G be the group for F of the given equation having the roots 
Xi,..., Xn. By the Lemma and Galois’s theorem ( §109), the 
adjunction of the first radical (12) to F either does not affect Q 
or reduces it to an invariant subgroup of index qi. The subse¬ 
quent adjunction of the second radical (12) either does not affect 
the resulting group or reduces it to an invariant subgroup of index 
gj. After the adjunction seriatim of all the radicals (12) we have 
a field containing Xi,... ,Xn, for which the group is now the 
identity Gi. Since we passed from G to (?i through a series of 
groups each an invariant subgroup of its predecessor of prime 
index, we have a series of composition of G such that each factor 
of composition is a prime. Hence G is a solvable group. 

111. General equation of degree n>4 not solvable by radicals. 
By §93, the group of the general equation of degree n (i.e., one 
whose coefficients are independent complex variables) is the 
symmetric group G for the field determined by its coefficients and 
any constants finite in number. By §101, G has as an invariant 
subgroup the alternating group, wMch is composed of all the even 
substitutions on n letters. 

Thbobem 13. If n > 4, the aUemating group A on n letters is 
simple. 

For, suppose A has an invariant subgroup H distinct from the 
identity group Gi. Of the substitutions of H different from the 
identity J, consider those whi(^ affect the least number of letters. 
Any such substitution h involves the same number of letters in its 
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various cycles. For, if two cycles involve a and b letters, where 
a < b, is not I and affects fewer letters than h does, since it 
leaves the a letters unaltered. 

Having A, the invariant subgroup H oi A contains the trans¬ 
form of h by any even substitution e and hence contains p = 
he • }r\ 

(i) If h contains more than three letters in any cycle, say 

h = (1234 the choice e = (234) gives p = (124), 

which affects fewer letters than A does. 

(ii) If A contains at least two cycles of three letters, say A = 
(123) (456) ..., the choice e — (125) gives p = (15243), which 
affects fewer letters than A does. 

(iii) If A contains at least two cycles of two letters, say A = 
(12) (34) ..., the choice e - (123) gives p = (13) (24), which 
affects fewer letters than A does unless A = (12) (34). For the 
latter A, the choice e = (125) gives p — (125), which affects fewer 
letters than A does. 

Hence the substitutions ^ I oiH which affect the least number 
of letters are all of type (a6c), since (a&) is odd and hence is not 
inH. 

Next, any cycle of three letters can be transformed into any 
other such cycle by an even substitution, so that H contains every 
cycle of three letters. For, (123) is transformed into (a6c) by 
both of the substitutions 


/I 2 3 4 5 
\a b c d e 




1 2 3 4 5 
a b c e d 


n 

I 


). 


where the dots denote the same letters in the two substitutions, 
one of which is even and the other odd. 

Finally, every even substitution e is expressible as a product of 
cycles of three letters. For, a — fe - -- tzh, where each U is a 
transposition. If ii and 4 are identical, they cancel. If they have 
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just one letter a in common, fe = (ab) (oc) = (abc). Ji they 
have no letter in common, 

tit 2 = (ab){cd) (dbc){cad). 

Similarly for UU,..., fejb-i Uk* Hence H ^ A. 

If n > 4, the symmetric group G on n letters therefore has the 
series of composition (?, A, Oi, with the factors of composition 2 
and I • n!. Since the latter is not a prime, G is not a solvable 
group. This proves 

Theorem 14, 7/ n > 4, the general equation of degree n is not 
solvable by radicals. Moreover, its roots cannot be found by rational 
operations and root extractions performed upon the coefficients and 
any constants, finite in number, or upon quantities derived from them 
by those operations. 


Exercises 

1 . If 71 > 4, there is no series of composition of the symmetric group G on 
n letters other than G, A, Gi. For, if G has an invariant subgroup H other 
than Gi, the proof of Theorem 13 shows that either R » A or else H contains 
a transposition (at). In the latter case H contains every transposition and 
coincides with G, 

2 . If the general quintic equation is not solvable by radicals, the same is 
true of the general equation of degree ti > 5. Hint: Equate to zero the coeflGl- 
dentsof for j <n ^ 5, 

3. Hence give another proof of Theorem 14 by amplifying the following 

new proof of the simplidty of the alternating group (r on 5 letters. Let s 
replace 1, 2, 3, 4, 6 by any assigned permutation a, h, c, d, e of them. If an 
invariant subgroup H of G contains (123), it contains its transform (a6c) 
by an even s. With (12) (34), H contains its transform (a5) (cd) by s or 
(12 )s, one of which is even. With t = (12345), H contains its transform 
k — (pbcde) by « if s is even. But if s is odd, I =* (3245) transforms = 
(13524) into so that Is is even and transforms into k. Hence H is of order 
^ 1 + 20a; + 15y + 242, where x, y, z take only the values 0 or 1. Also, 

A is a divisor < 60 of 60, If h has the factor 5, then 2 = 1 and h = 25, which 
is not a divisor of 60. Hence A is a divisor of i 60 and hence A = 1. 
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4. If for a field F the group of an equation \vith the roots xi^.,. jXn is 
generated by « — {tcix^ • Xn)y the equation is irreducible and xi — r(aJi), 
where r is a rational function with coefficients in F, and also xz = r ( 0 : 2 ),. .., 
Xn = r(a;»_i), xx = r(»n). Conversely, an irreducible equation in F whose 
roots satisfy these n relations has a regular cyclic group for F. 

5. By supplementing Ex. 3, show that every substitution of the alterhatmg 
grout) on 5 letters is conjugate to 7, (123), (12) (34), t = (12346), or 

112. Solvable quintics. The Jacobi-Cayley resolvent sextic of 
a quintic has been found recently by the author^ by direct, ele¬ 
mentary methods. The quintic is solvable by radicals if and only 
if the sextic has a rational root. He easily deduced the covariant 
resolvents of Perrin and McClintock. 

Berwick^ computed a resolvent sextic whose coefldcients are 
invariants of the corresponding quintic form. 

^BuU. Amer. Math. Soc., 31, 1925, 515-23. For this and other resolvents, see 
Brioschi, Math. Annalen, 13, 1878, 109-60. 

2Proc. London Math. Soc., (2), 14, 1916, 301-7. Result quoted in Mathews, 
Algebraic Equations, 1915, 55. 



Chapter XI 

CONSTRUCTIONS WITH RULER AND COMPASSES 

113. The problems to be solved.. We shall prove that it is not 
possible, by the methods of Euclidean geometry, to trisect all angles, 
or to construct a regular polygon of 7 or 9 sides. For these problems 
the ancient Greeks sought in vain for constructions with ruler and 
compasses. We shall specify all the values of n for which a regular 
polygon of n sides can be so constructed. One such value is 17. 
The fact that a regular polygon of 17 sides can be constructed 
with ruler and compasses was not suspected during the twenty 
centuries from Euclid to Gauss. 


114. Analjrtic criterion for constractibility with ruler and com¬ 
passes. The first step in our consideration of a problem proposed 
for construction consists in formulating the problem analytically. 
In the ancient Delian problem of the duplication of a cube, we 
take as the unit of length a side of the given cube, and seek the 
length a; of a side of another cube whose volume is double that of 
the given cube, whence = 2. 

For the problem to trisect a given angle A, we employ the 
trigonometric identity 


( 1 ) 


cos A = 4 cos® — 3 cos 


A 

3‘ 


Multiply each term by 2 and write x for 2 cos Thus 
(2) X? — Zx --2 cos A = 0. 


Given A we can construct (with ruler and compasses) a line whose 
length is the positive value of ± cos A; it is the adjacent leg of a 
right triangle, with hypotenuse of unit length, formed by dropping 
a perpendicular from a point in one side of A to the other side, 

204 
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produced if necessary. If it were possible to trisect angle A, i.e., 
construct angle iA, we could as before construct a line whose 
length is ± cos fA, and hence a line whose length is double that 
of the former, viz., ± x. 

In each of these problems we have been led to the question of 
the possibility of constructing a line whose length is ± x, where x 
is a real root of a cubic equation with known coefficients. 

Criteeion.^ If X is a real root of a given equatioUi it is possible to 
construct with ruler and compasses a line of length ±, x if and only if 
X can be derived by rational operations and extractions of real square 
roots performed a finite number of times upon the coefficients or upon 
numbers obtained from them by those operations. 

First, if X can be derived in the manner indicated in the criterion, 
we can construct a line of length ± x. For, a rational function of 
given numbers is obtained from them by additions, subtractions, 
multiplications, and divisions. The construction of the sum or 
difference of two segments of straight lines is obvious. To con¬ 
struct segments whose lengths are the product p = a6 and quo¬ 
tient q = a/b of the lengths a and b of two given segments, use 
similar triangles two of whose pairs of corresponding sides are 
1, 6 and a, p, and 1, q and b, a, respectively. Finally, a segment s 
of length n^, where n is positive, may be constructed by drawing 
a semicircle on a diameter composed of two segments of lengths 
1 and n, and drawing a perpendicular s to the diameter at the 
point which separates the two segments. 

Second, suppose that we can construct a line segment of length 
± X, when we are given segments representing the coefficients of 
the equation for x, whose leading coefficient is unity. We may 
choose any line as the first constructed line OX and choose any 
point 0 on it as the point from which we draw another line or 
about which as center we draw a circle. Take 0 as the origin and 

^Descartes, La g6om6trie, Leyde, 1638, livre premier (German transl, by Sohle» 
singer, Berlin, 1894; English transL, Open Court Publishing Company). 
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OX as the a;-axis of a system of rectangular coordinates. The fuiv 
ther lines and circles constructed are determined by points which, 
with the exception of 0, are located as the intersections of straight 
lines and circles. The coordinates of the intersection of two inter¬ 
secting straight lines are evidently rational functions of the 
coefficients of their equations. If the straight line y mx+ b 
intersects the circle 


(x - py + {y- qy r®, 

the coordinates of the points of intersection are found by eliminat¬ 
ing 2/, solving the resulting quadratic equation for x, and inserting 
the roots x into y = mx + 6; hence the coordinates are obtained 
from 6, p, q, r by rational operations and the extraction of a 
real square root. EinaUy, two intersecting circles cross at the 
intersections of one of them with their common chord, so that this 
case reduces to the preceding. Since each such circle or line other 
than OX was determined by two points which (with the exception 
of 0) were found as intersections of earlier lines and circles, the 
coefficients of their equations are obtained by rational operations 
and extractions of real square roots performed either upon the 
coefficients of the given equation or upon numbers obtained from 
them by those operations. 

115. Tiisection of an angle. To prove that it is not possible to 
trisect angle 120° with ruler and compasses, note that cos 120° = 
— so that (2) becomes 

(3) x^-Zx + l== 0 . 

By Ex. 3 of §81, this equation is irreducible in the field R of 
rational numbers. Since its discriminant is 81, the alternating 
function has a rational value + 9. Hence the group for R is the 
alternating group O 3 of order 3. By the adjunction of a square 
root, the group is either not changed or else is reduced to a sub- 
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group of index 2 ( §109). The second alternative is excluded. 
If a root of the cubic equation could be found by rational oper¬ 
ations and extractions of square roots performed upon its co¬ 
efficients or upon numbers obtained from them by those opera¬ 
tions, the adjunction of that root to R would not reduce the group 
Gzi whereas the adjunction of any root reduces Gz to the identity 
group. The above criterion therefore proves that it is not possible 
to construct a line whose length is the root 2 cos 40° of the cubic 
equation, i.e., to construct angle 40°. Hence it is impossible to 
trisect angle 120° with ruler and compasses.^ 

Exercises 

1 . A regular polygon of 9 sides cannot be constructed with ruler and 
compasses. 

2 . The duplication of a cube is impossible with ruler and compasses. 
Hints: The group iovRoiz^ = 2 is the symmetric G^. The adjimction of one 
root reduces it to a group of index 3. 

3. It is impossible to trisect an angle whose cosine is J, J, i, }, or p/g if 
p and q (g > 1) are integers without a common factor and q is not divisible 
by a cube. 

4. It is possible to trisect an angle whose cosine is (4a8 — 3a6®)/5*, if the 
integer a is numerically less than the integer h. Hint: Take cos iA = a/h in 
( 1 ). 

6 . It is impossible to construct lines representing the lengths of the edges 
of a rectangular parallelopiped having a diagonal of length 5, surface area 24, 
and volume 1, 2, 3, or 5. 

6 . It is impossible to construct a straight line representing the distance x 
from the circular base of a hemisphere to the parallel plane which bisects the 
hemisphere (problem of Archimedes). Hint: The equation for a; is (3). 

116. Regular polygons. The construction of a regular ri-gou by 
ruler and compasses is equivalent to that of angle 2ir/n and hence 
of a line of length ± cos 2T/n. The irreducible equation satisfied 

^The author gave an elementary proof without group theory in Amer. Math. 
Monthly, 21,1914, 259-62; Mathematics Teacher, 14,1921, 217-23. 
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by the latter number is much more difficult to form and treat than 
that having the root 

(4) r = cos 27r/n + i sin 2w/n, 
which is an nth root of unity. We have 

1 /r = cos 2 T/n — i sin 27r/n, r + 1 /r — 2 cos 27r/n. 

Hence if r can be expressed in terms of i and real square roots, 
cos 2 T/n can be expressed in terms of real square roots. The con¬ 
verse follows from (4) and the fact that the sine can be found from 
the cosine by a real square root. Hence a regular n-gon can be 
constructed with ruler and compasses if and only if the imaginary 
nth root (4) of unity can be found by extraction of square roots, 
all except the last one of which is real. 

First, let n = p*, where p is an odd prime. Then r is a root of an 
equation of degree e = (p — 1) whose group for the domain 
R of rational numbers is a regular cyclic group G of order e ( §106). 
Since the identity is the only substitution of (? which leaves a root 
unaltered, the adjunction of that root reduces G to the identity 
group ffi. If a regular p*-gon can be constructed with ruler and 
compasses, the adjunction of the root r is equivalent to that of 
several square roots, the adjunction of each of which causes either 
no reduction in (? or a reduction to a subgroup of index 2 ( §109). 
Since the adjunction of r reduces 0 to (?i, its order a is a power of 
2. If s > 1, 6 has the odd factor p and is not a power of 2. If 
5 = 1, then e = p — 1; if also e = 2®^ where a is odd, then p = 
2“ * -b 1 has the factor 2* + 1 and is not a prime. Hence if a regular 
p*~gon can be constructed^ where p is an odd 'primes then s = 1 and 
pis of the form 

(5) 2^ + 1. 

For fc = 0,1, 2,3,4, the corresponding numbers (6) are 3, 5,17, 
257, 65537, all of which are primes. But for k = 5, Q, 7, 8, 9, 11, 
12, C5) is known to be not a prime. 
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Conversely, if p is a prime such that p — 1 = 2*, a regular 
p-gon can be constructed with ruler and compasses. For, the 
group for R of the equation for r in (4) is a regular cyclic group 
Q of order 2*. Hence 0 has a series of composition for which each 
index is 2. 

By §104, the solution of the equation for r reduces to the solu¬ 
tion of a series of quadratic equations.^ 

Next, let n = a6, where a and & are integers without a common 
factor >1. If a regular o-gon and a regular 6-gon can be con¬ 
structed with ruler and compasses, the same is true of a regular 
?2rgon. For, any multiples of the angles 27r/a and 2Tc/h can then 
be constructed and hence also the difference of these multiples. 
By §78, there exist positive integers c and d such that ca — = 1. 

Hence we can construct the angle 


27r , 2ir 

c • -- d -- — (ca — do) 

0 a ao 


ab 


and therefore a regular a6-gon. Conversely, from the latter we 
obtain a regular a-gon by using the vertices numbered 1, 6 + 1, 
2b + 1, , (a — 1) b -b 1. Hence if n - p* q* • • •, where 

p, g,... are distinct primes, a regular n-gon can be constructed if 
and only if a regular p*-gon, a regular g^^-gon,... can all be con¬ 
structed. Since a regular 2^-gon can be constructed by repeated 
bisections of 180®, our results may be combined into the 

Theokbm. a. TBgvlcLT polygon of n sides con be constvucted with 
ruler and compasses if and ordy if n = 2* pi p 2 .. • j where pi, P 2 i • • • 
are distinct primes of the form (6). 

Iln Dickson’s First Course in the Theory of Equations, 1922, 41-44, these 
quadratic equations are obtained when p = 17 and a conslaructioii of a regular 
17-gon is deduced. The same results are given also in his article m Monographs on 
Modern Mathematics, Longmans, Green, & Co., 1911, wHch gives also an expoa- 
tion of the method of Gauss for finding the series of equations whose solution leads 
to f for any prime p. 
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REDUCTION OF EQUATIONS TO NORMAL FORMS 

117. Tschimhaus tiansfonaations. Let Xi,... ,Xn be the 
roots of 

(1) fix) s + Cl a:""^ + • • • + c» = 0. 

To find the equation having the roots = *< + v, we have merely 

to apply the transformation y = x v; we get 

f(y — v) = + (Ci - 4- • • • = 0. 

The coefficient of is zero if v = ci/n. Hence we can remove 
the second coefficient by rational operations. 

To find the equation whose roots are the squares of those of 

(1) , transpose all terms of odd degrees, square each new member, 
and then replace a? by y. 

These transformations y^^x + v, y = a^, as well as y = 1/x 
(which is used to find an equation whose roots are the reciprocals 
of the roots of a given equation), are all cases of a Tschimhaus 
transfomaation 

(2) y = gix)/hix), 

where g and h are polynomials such that h (x) vanishes for no root 
of f(x) = 0. 

Exescises 

1 . The equation whose roots are the squares of the roots 1, 2, — 2 of 
4aj + 4«0is^ — + 24^/ — 16 = 0. 

210 
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2. Knd the equation whose roots are the cubes of those of a:' + as + 6 » 0. 
Hint: 

(— “ a® a;® -f 6* + 2ahx(ax + &). 

Heplace ox + b by — a:® and then write y for x*. 

3. To find the equation whose roots are the fcth powers of the roots of (1), 
employ the Arth roots ri,..., r* of unity and 

h li 

P — ILf(yri) « n [( 2 /rf - Xi) • • ♦ (yn — x»)] 

»-i t’-i 


» n 

= 'Jl[(yri—Xj) ••• (jTi - Xi)] = (— n (s/* — »/*). 

/-I /-I 

Hence the expansion of the initial product P contains only powers of y whose 
exponents are multiples of h. Write y^ = z, 

4. Transformation (2) is only apparently more general than y =* P(x), 
where P is a polynomial. Hint: By §78, there exist polynomials s(x) and 
t{x) for which 1 s s(x)/(x) + t{x)k(x). Take as x a root x* of/(x) = 0, 

5. Let 4i{y) =» 0 be the equation having the roots yi =P{xi) for 
1 = 1 ,..., n. K 2 / 1 ,..., 2 /n are distinct, then Xi ^ r ( 2 / 1 ),..., x» * r(yn), 
where r (s) is a rational function of z whose coefficients are derived rationally 
from those of P (x) and/(x). Hence if we solve 4>(y) =0 and find that its 
roots are distinct, we can find the roots of / = 0 rationally in terms of the yi. 

Hint: The greatest common divisor di(x) oi fix) and P(x) — 2/4 is found 
by rational operations, and is of the first degree in x (otherwise it would vanish 
for two distinct roots x< and xy of / = 0, and then P (x) — yi would vanish for 
X » Xf, whence yj — yi). 

6. If </> ( 2 /) » 0 in Ex. 5 has a root yi of multiplicity mij the determination 
of the m< roots of /(x) » 0 for which P (x) has the value yi requires the solu¬ 
tion of an equation di (x) » 0 of degree un. 

Hint: For each of the mt roots, both /(x) and P(x) — yi vanish, and 
therefore also di (x), which is a linear combination of them ( §78). 

118. Principal equations. We can alwa3rs transform (1) into a 
principal equation of degree n in j/ in which the coefficients of 
and are both zero. By §117, we may assume that ci = 0. 
Let ^ 0. Write Sk for the sum of the id;h powers of the roots 
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Xi,, z„. By Newton’s identities, Si — 0, S2 = — 2c2. Employ 
the transformation y = + ax + h. Write yn for the value of 

y when x — Xk. Then 2 y* = S2 + «& will be zero if we take 
6 = — Si / n . Next, 

y* = »* + 2aa? + (a® + 2b)x^ + 2dbx + 6®, 

whence 

2 = S 4 + 2oS 8 + (o® + 26 )s 2 + n&® = 0 

is a quadratic equation for a in which the coefficient 82 of a® is not 
zero, and hence has two roots. Choosing either root, we may 
compute similarly ^y*ky .. and then compute by New¬ 

ton’s identities the coefficients of the equation having the roots 
VU • • • j yn* 


Theorem* 1 . By a linear transformation, or by a quadratic 
Tschimhaus transformation whose coeffidewts involve a single square 
root, any equation can be reduced to a principal equation. 

119. The Bring-Jexrard normal form. We shall transform any 
principal equation into one in which the coefficients of y"~*, 
y“"®, y”"® are aU zero. By hypothesis, si = 82 = 0 in (1). If also 
S3 = 0, then Cl = C2 = C3 = 0 by Newton’s identities, and no 
transformation is necessary. Next, let Ss 5 ^ 0. The functions 

(3) g{x) — a? + 03? — Ss/n, h(x) = a? + b3? — Si/n 

evidently have the property S 9 = 0, 2 A = 0, where 2 9 
denotes ^g(,Xk). The conditions for 2 *9 ~ '^xh = 0 are 
84 08* = 0, 83 -f 683 = 0, and may be satisfied by choice of 

a and b. The function 

(4) ^(x) = eg + wh 


1 Tschimhaus, Acta Eraditorum, 2,1683,204-7. 
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will have the property S ^ = 0 if 

(5) + 2zw + 

This homogeneous quadratic equation in z and w can evidently be 
satisfied by values of z and w not both zero. Then x and ^ have 
the properties 

(6) 23 ^ = Oj 23 S ^ Z) 0* 

Write y =^ux + v4^. Then Z) y = 0, 2^ 2 /^ = 0 for every u 
and V* The condition for 23 2/® ” 0 is a cubic equation in w : v and 
hence can be satisfied by values of u and v not both zero. In the 
resulting equation in the coefficients of 2/””S 2/"'“^ 2 /’‘‘"®are 
therefore all zero. 

We saw that any equation F{z) = 0 can be reduced to a prin¬ 
cipal equation f{x)=0 by a transformation x ^ P{z). To 
f{x) — 0 we just applied a transformation y - Q(x). Hence the 
final equation in y may be derived from F (z) == 0 by the single 
transformation 2/ = QI-PC^)]* Id- other words, the product of two 
Tschimhaus transformations is a Tschimhaus transformation. 

This completes the proof of 

Theoeem 2. By means of a Tschimhaus transformation whose 
coefficients involve a cube root and three square roots, any equation of 
degree n can be transformed into an equation of degree n in yin which 
the coeffiwients of 2/"“® are all zero. 

This theorem is usually ascribed to Jerrard.^ But for = 5 it 
was obtained much earlier by E. S. Bring.® We therefore call 

iMaUi. Researches, Brxstca?aixd London, 1834, Ft. 11. Report by W. R. Hamil¬ 
ton, British Assoc. Report, 6, 1837, 295-348. 

*Meletemata . . . transformationem aequationum algebraicanm, Lund, 1786. 
Substance reproduced in Quar. Jour. Math., 6,1864, 45—47; Aichiv. Math. Fhys., 
41, 1864, 105-112; Annali di Mat., 6, 1864, 33-42. 
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(7) j/® + dy + e = 0 

the Bring-Jerrard normal form of quintic equations. 


Exebcises 

1 . Any equation of degree 3 or 4 can be solved by radicals. 

2 . Every quintic equation can be reduced to tbe form 2 /® + + e = 0 by 

a Tsohimhaus transformation whose coefficients involve a square root and a 
root of a quartic equation. Hint: If Ci ~ C 2 = 0 , then S 4 + 4 c 4 = 0 ; hence 
Tise U ^ = 0 . 

3. By choice of i in y = te, reduce (7) and the quintic in Ex. 2 to normal 
forms involving a single parameter. 

4. Derive (7) geometrically. Hints: Let the transformation y — Uo + oi a: 
+ 02 X® -h Os + 04 replace (1) by 2 /® + di 2 /* + (fe 2 /* + • • - = 0 . Then 
dj is a homogeneous polynomial in oo,..., U 4 of degree j. By means of di = 0, 
express oq as a linear homogeneous function of Oi,..., 04 . Interpret the 
latter as the four homogeneous coordinates of a point P in space (the lengths 
of the perpendiculars from P to the faces of a fixed tetrahedron of reference 
being proportional to oi,..., 04 )- Eliminate ao from ^2 “ 0 and ds - 0 . The 
resulting quadratic equation represents a quadric surface, which contains 
real or imaginary straight lines; usually there are two such hues through each 
point of the surface and then they are found by means of a quadratic equation 
in one unknown. The intersections of a chosen one of these lines with the cubic 
surface obtained from dg = 0 are found by solving a cubic equation in one 
unknown. 

120 . The Brioschi normal form of (|umtic equations. In the 
preceding Ex. 3, we saw that any quintic equation can be reduced 
to a normal form involving a single parameter by means of trans¬ 
formations involving both square roots and a cube root. By 
means of a transformation involving only square roots, we shall 
now reduce any sufidciently general quintic equation to a remark¬ 
able form which also involves a single parameter and which plays 
a central rdle in the theory of quintic equations. 

We start with a principal quintic fix) = 0, having therefore 

“0, §2 = 0. We assume that Ss ^ 0, and employ the poly- 
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nomial (4) having the properties (6). We shall first prove the 
existence of constants p, g,..., ^ (p and q not both zero) such 
that 

(8) pi/^ + 2qx\l/ + rx^ — ayp — bx + t ^ 0 [mod /(a;)], 

which means that the left member is the product of f{x) by a 
polynomial in x. 

First, let « = 1, = 0 in (4), whence ^ is the cubic g. From 

^ we eliminate and by means of /(x) =0, then eliminate x* 
by means of x^ and finally x® by means of we get (8) with 

p = 1. 

Second, let w We may take w? = 1 and write 
^ = X* + aix® + dx® + c. 

For a certaia quadratic function Q of x, we have 

C = xyl/ — /(x) — z\l/ - kx^ + Q, k - d — z?. 

If fc = 0, this gives (8) with p = 0, 2^ = 1. If fc 0, we elim¬ 
inate X® and higher powers of x from ^|/^ by means of/(x) = 0, then 
x^ by means of and finally x® by means of C, and obtain (8) 
withp = 1, 

Inserting the five roots of /(x) =0 into (8), summing, and 
applying (6), we see that i = 0. 

If X aiid <l> are any linear functions of x and (6) imply 

(9) E X = 0, 2 X* = 0, E = 0, E E X<^ = 0, 

while (8) implies a similar relation 

flO) PX® + 2q<l>x + - ox - = 0 [mod/(»)]. 
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We employ the identity 


( 11 ) 

where 


mF = (dx + e 4 >y — c(ax + b 4 >y, 
F - px* + 2q4>x + r4?. 


( 12 ) 


m-pV^- 2qdb + ra\ c = f - pr, d - bp - aq, 
e = bq — ar. 


We exclude the special quintics for which c = 0 and those for 
which both a and b are zero. The linflar functions x and ^ of 
X and )ff may be chosen so that ax + is identical with ux + 
where u and v have any prescribed values not both zero. We 
can choose them so that ux-\-v^ vanishes for no root of/(a:) = 0. 
Hence it is permissible to apply the transformation 


(13) 


y = 

®X + b4> 


f (®) = 0* Denote the resulting equation by 
(14) + (iiy* + <hy^ + oav^ + (hy + ot = 0. 

We determine its coefficients as follows: By (13), 


y + c» = ^x + e^ + c*(ax + i!><^) 
ox + b4> 

If OT in (12) were zero, F would be divisible by ax + b<l> and the 
quotient of (10) by ax + b(t> would be divisible by/(a:), whereas 
its degree is ^ 4. Then if the numerator of the preceding fraction 
is zero for either value of ci, (11) shows that F == 0, and thus (10) 
shows that ox + is divisible by/(*), contrary to its degree. 
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m(ax + 6 <^) ^ (dx + e^)* — c(ox + [mod/(a!)I. 

Hence, by division, 

fJZ 

—-^ = dx-he^-t^(ax + b4>) [mod/(a;)]. . 
y + c* 

By (9), this implies 

S * = 0) 13 2® = 0 if 2 = l/(j/ + d). 

Hence if we replace y by (1 — 2 c^)/z in (14), we obtain 

(1 — acl)® + oi z (1 — zcl)* + Os z®(l — zc*)* 

+ flS Z*(l — ZC*)^ + O 4 Z^(l — ZCi) + 08 z® = 0, 

in which the coefficients of s* and z® must vanish. Thus 

5c® — 4ai c* + 302 c — 2as c* + o* = 0, 

— 10 c* + 601 c — 802 c* + Os = 0 . 

Since these hold for both values of c*, 

6 c® + 3 o 2 c + 04 = 0 , 4ai c + 203 = 0 , 

10 c + 3 o 2 = 0 , 601 c + Os = 0, 

whence 01 = 03 = 0 , 02 ==—f 10c, o* = 5c®. Write c = — 30. 
Then (14) becomes 

(16) y® + 10Cy» + 45C® y + Ob = 0. 

This Brioschi® normal form is especially adapted for solution® 
in terms of elliptic functions. The effect of writing y = T in 

^ Annali di Mat., 1, 1858, 256—9, 326^. 

®Kiepert, Jour, fur Math., 87, 1879, 130. 
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(15) is to put C = 1. However, by a transformatioix not involving 
radicals, we may pass from (15) to a quintic equation having a 
single parameter ( §131, end). 

The only irrationality employed in the transformation (13) of 
the principal quintic/(a:) = Oto the form (15) was the square root 
of the discriminant of (5). We may express the latter in terms of 
the discriminant H* of f(x). By definition, U is a determinant 
whose Jbth row is 1, Xi, 3?h, In view of (3), D is equal to 

the determinant whose Aiih row is 


1, h(irjb). 

Let D' denote the determinant having these elements in the fcth 
column. Forming the product D'D by using the rows of D' and 
the columns of D, we see that 


= 


5 0 0 0 0 

0 0 sg 0 0 

0 Sg S4 2 2 

0 0 Ugh 

0 0 Y.gh 2 ^* 


5S8* 


2 2 gh 

2ffA 


Hence the discriminant of equation (5) is equal to iD^/s^K 


Theorem^ 3. If we denote the product of the sguares of the differ-^ 
ences of the roots of the principal quintic f(x) ^ 0 by 5®A, that 
equation can be reduced to the form (15) by a Tschimhaus trans- 
formation whose coefficients involve the single irrationality A*. 

Brioschi^ proved the existence of ^(n — 1) polynomials <l>, 
Xi[^ = 3)] such that the sum, sum of squares, and 


^The above proof is a material simplificatioii of that by Gordan, Math. Annalen* 
28,1887,162—166 (Jour, de Math., (4), 1,1885,465-8). He introduced an unnecea- 
sary irrationality, ^e square root of the discriminant of F, since he used instead of 
(13) a transformation involving the linear factors of F, The part of the proof be¬ 
ginning with (11) follows Weber, Algebra, ed. 2, 1898, I, 264r-7. 
sRendiconti 1st. Lombardo, (2), 20, 1887, 364-70; Opere Mat.,. HI, 293-9. 
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sum of products of any two, summed for all the roots of the equa¬ 
tion of degree n, are all zero. This was proved in (9) above when 
n = 5. 

In his Chicago thesis to be published soon, R. Garver proves 
that every Tschimhaus transformation of a sufficiently genei^ 
principal equation of degree n into another principal equation 
can be expressed in the form y = ux + vtf>, where x and 4> are 
polynomials of degrees n — 1 and n — 2 which satisfy (9). 



Chapter XTTT 


GEOTIPS OF THE BEGTILAB. SOLIDS; 

QTJINTIC EQUATIONS 

121. Introduction. The first part of this chapter gives a theory 
of the invariants of each finite group of linear transformations on 
two variables. These groups are isomorphic to groups of rotations 
leaving unaltered the various regular solids. The final part is an 
application to the resolvents of the general equation of the fifth 
degree and furnishes a method of solving the latter. The subject 
plays an important rdle in the theories of elliptio modular func¬ 
tions and automorphic functions. 

The theory is due to Klein, whose book^ is a classic, but pre¬ 
sents difidculties to beginners on account of the introduction of 
ideas from many branches of mathematics. We shall give a aim pla 
exposition of the essentials of this interesting theory. 

122 . Linear fractional transformatiou on z corresponding to a 
rotation. To readers acquainted with the theory of functions of a 
complex variable there is available* a short proof of the desired 
formida (7). We shall give here a strictly elementary proof. 

We first define the stereographic projection of a sphere upon a 
plane. Employ a rectangular coordinate system in space. The 
sphere with radius unity and center at the origin 0 has the equa¬ 
tion 

(1) 5* + 1?* + f* = 1. 

From the point W = (0,0,1) project an arbitrary point 
■P = (Ij It) t) of the sphere to a point Q = (x, y) of the f1^plane. 

^Vorlesungen fiber das Ikosaeder, 1884. 

^Burkhardt’a Funktiontheoretasohe Yorlesungen, I, 2, 1903, 49-62 (EngUsb 
transl. by Rasor, 81). 
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Let T be the fourth vertex of the rectangle having sides ON and 
OQ. Let/SPF be parallel to JVO. Let PffandQfTbeparalldtothe 
j^axis. Then 00 = FG — ri, PF — f, OH = sc, QH = y. Since 
the triangles NSP and NTQ are similar, 



Fig. 1. 


1 - ^ SP : TQ = NS :NT = OF :0Q= ij ry = J 

Hence 

(2) sc = 7-^, y = ^ 


By (1), 

1 + »* + = 
whence 


1 - f' " 1 - f 

(1 - r)* + s® + u* 1 - 2r +1 2 


(1 - f)* 


(1 - f)* 1 - f’ 


(3) I = 2x/r, rj = 2y/r, f = 1 — 2/r, r = 1 + »* + 2/®. 

Write s = X + iy. Formulas (2) and (3) establish a one-to- 
one correspondence between the points of the unit sphere and the 
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points of the complex a-plane, provided we agree to identify all 
points at i n fin ity in the z-plane (so that z = co alone corresponds 
fcoA^). 

Let E = (X, n, v) and P — (|, ij, f) be any points on the sphere 
having radius unity and center at the origin 0. Write 


a 


(4) a = X sin -> 
2 

whence 


, .a • « j ^ 

0 = /* Sin c ^ vBm-} d — cos 
2 2 2 


(5) 


a* + 6* + c* + = 1. 


Consider the rotation about an axis OE through an^e a counter¬ 
clockwise when viewed from E toward 0. Let it replace (f, i?> f) 
by (S'j v'} f')- We employ the formulas. (28) of §46 which repre¬ 
sent the rotation, with %, oz, Oz replaced by a, b, c, and wi, m, wt 
replaced by v, and pi, pz, p* replaced by g', if', f 
Write 3 = X iy, w = X — iy. Then (3) become 


z-{- w i(w — z) ^ zw — 1 

- ■ , ^ _ —- - - , -- - - 

l + 2!t0 1 + ZU> 


Hence 

i(l + 2M>)(1 — f') = c® + d* -f ot(1 — — d^) + ez + ew, 

where e = bd — dc + i(ad -f- be) = (6 -|- ia) (d -f ic). Replac¬ 
ing 1 — c* — d® by the equal value a* H- 6®, we get 

i(l + nw) (1 — f') = [(&■+■ ia)z -f d — ic]f, 
f = (b — ia)w -t- d -|- ie. 

Similarly, 

^(1 -j- 2w) (I' -h it]') = [(d -f ic)z — (6 — ia)]f. 
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( 6 ) 


2 = 




2' = 


+ W 


(7) 


1-f' " 1-r 

(d + ic)z — (& — ia) 


(b + ia)z + d — ic 
Thbobem 1. Every rotation corresponds to a transformation (7). 


Exbbciseib 

1. By (4) and (7), with X = ju = 0, v = 1, prove that the rotation about 

the ON through angle a counterclockwise viewed from N is repre¬ 

sented by 

(8) s' = e*“ 0 . 

2. Prove (8) by use of the formulas from analytic geometry, 

= { cos a — ij sin a, v' = S sin a + n cos a. 

123. The tetrahedral group. Consider a cube inscribed in a 
sphere with the center 0 and radius unity. Four of its vertices 
1,2,3,4, shown in Kg. 2, are the vertices of a regular tetrahedron. 
Their diametral points 1', 2', 3', 4' are the vertices of a second 
regular tetrahedron called the dwmetrcd tetrahedron to 1234. 

The mutually perpendicular lines AA’, BB', CC', each of 
which joins the mid points of a pair of opposite edges of 1234, 
are taken as the J-axis, i;-axis, f-axis, respectively, their positive 
directions being from 0 to A, B, C, respectively. 

The tetrahedron 1234 is unaltered by a rotation through angle 
0®, 120°, or 240° about any of the lines 01, 02, 03, 04 as an axis, 
and also by a rotation through 180° about one of the axes AA\ 
BB', CC'. These include the identity rotation (through angle 
0°), eight rotations of order 3, and three of order 2, They give all 




Fig. 2. 
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the rotations leaving' the tetrahedron 1234 unaltered and hence 
form a group called the tetrakedrd group Rn. For, exactly three 
such rotations leave the vertex 1 unaltered, while their products 
by any one rotation (about axis 04) which carries 1 to 2 give aU 
the rotations which carry 1 to 2, and similarly when we use 3 or 
4 instead of 2, whence the total number is 3 X 4. 

Consider in particular the rotation S about the axis 01 through 
angle 120° counterclockwise when viewed from 1, and the rotation 
R about the axis 00" through angle 180°. Since R interchanges 
1 and 4, RSR and R^R leave 4 unaltered and hence give 
rotations about the axis 04 which carry 1 to 3 and 2 respectively. 
By the final remark in the preceding paragraph, the identity, S, 
and /S*, together with their products by RS^ R, RSR, and R 
(which carry 1 to 2,3, and 4 respectively) give all the rotations of 
■Bju* 


Theorem 2. The tetrakedrd group Rn of dl rotations which 
home undtered the regular tetrahedron 1234 is generaied by the two 
rotdions S and R. 

We shall next obtain by §122 the linear fractional transforma¬ 
tions on 2 which correspond to the rotations S and R. In the case 
of R, we have merely to employ (8) for a = ir to obtain a' = — s. 
Next, the axis 01 of rotation S makes the same acute an^e with 
the positive directions of the coordinate axes and hence the 
coordinates of point 1 are \ = n = v = 3"^. Since a = 120° and 
cos 60° = sin 60° = ^ • 3*, (4) gives o = & = c = d = i. 
Then (7) becomes 


(9) 


z + i 
z — i 


Theorem 3. The group Fvt of dl tetrahedrd linear fractional 
transformations is generated by {9) and z' — — z. 

To find the product AB of a transformation A defined by (7) 
by a second transformation B :z' = fiz), we write the latter in 
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the form = /(2O and eliminate 0' between the final equation 
and (7). The resulting transformation 2" = is defined to be 


AB. 


Exercises 

1. The group Rn is simply isomorphic to the alternating group on the 
letters 1, 2, 3, 4 denoting the vertices. 

2. The 12 transformations of Fn are 


( 10 ) 




,1 ..s + l ,.2 — 1 ,2-fi 

± Zf ±^ -r» ± ^ ——f ± - 

Z 2 — 1 2 + 1 2—t 


± 


2 — i 

2 + t 


Hints: is given by the third function (10) with the upper sign, 

while L s BS is 2 ' = I/ 2 . The last may be computed as a product or 
proved by noting that L leaves A fixed and hence is the rotation through 180® 
about axis AA', Tor X = l, = 0, « = 180°, (4) gives a « 1, 6 » c 
as d =* 0, whence (7) becomes 2 ' =* i/ (i z). 

3. If R (or z' s=s — z) is the rotation through 180° about axis CC' and if 
L (or 2 ' as i/z) is that about AA*, their product M ** RL (or 2 ' — I/ 2 ) 

is the rotation through 180° about the common perpendicular BB' to CC* 
and AA\ Hence show that iS, L, ilf, and the identity rotation form a com¬ 
mutative group of order 4. 


To each transformation (7), of determinant unity, corresponds 
the linear homogeneous transformation 

V U = (d + ic)u — (6 — ia)v, 

= (b i(i)u + (d — ic^Vf 

of deteminaiit unity, as well as that obtained by obang in g the 
signs of a, b, c, d, which is therefore the product of (11) and 
U = — w, F = — w, in either order. 

In particular, to z' = ia/(— i) and (9) correspond 

( 12 ) 


(13) 


U = iu, V = — iv, 

V = §(1 + i)u — J(1 — i)v, 
F = 1(1 + i)u + §(1 — i)v. 
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Thbobebi 4. The group of all tdrahedrcd, homogeneous linear 
transformations is generated by (12) and (13), each of determinant 
unity. 

124. Invaiiants of the tetrahedral group H24. The vertices of 
the tetrahedron 1234 have the coordinates 

(14) { = ± 3-i, ^ = ± 3-i, f = ± 3-i, 

where an odd number of the signs are plus. By (6), the corre¬ 
sponding values of 2 are ± a and ± b, where 

1+i , 1-i 

® 3*-l’ ^ 3*4-1' 

We seek the product # of the factors u T u 4^ 5o. Since 

a? = i (2 3*) and 6® = — i(2 — 3*) have the product 1 and 
sum 2 • 3H', we get 

(15) $ = u*-2*3*w*r»4-«^. 

Since the 12 rotations which leave the tetrahedron unaltered 
merely permute the four vertices, each of the 24 tetrahedral homo¬ 
geneous transformations merely permutes the above four linear 
functions, apart from constant factors, and hence leaves $ un¬ 
altered up to a constant factor. That factor is evidently 1 for (12), 
while for (13) it is 

ft(l + f)]ni - 2 • 3* f 4-1) = |(- 1 4- 3* 0 = e®'*''®- 

Hence is an absolute invariant of the group Hu- 
The vertices of the diametral tetrahedron 1'2'3'4' have the 
coordinates (14), where now an odd number of the signs a,re 
minus. The corresponding values (6) of 2 are ± d and ± b. 
Hence the group has the absolute invariants and where 


(16) 


’S' = u* -f 2 • 3* w*4- »*. 
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Any rotation which leaves the tetrahedron 1234 unaltered mere¬ 
ly permutes its six edges and hence their middle points. It there¬ 
fore permutes the points in which the axes of coordinates intersect 
the unit sphere; the corresponding values of z are evidently 0, oo, 
± 1, ± The product of the linear functions which vanish for 
these values of z = u/v is 

(17) t = uv(u^ — v^). 

It is an absolute invariant of the group, being unaltered by (12) 
and (13). 

A point on the sphere is called a special point with respect to 
the group of rotations leaving the tetrahedron 1234 unaltered 
if the point takes fewer than 12 distinct positions under these 12 
rotations. In other words, a point is a special point if and only if 
it is unaltered by at least one of those rotations other than the 
identity, and then the point lies on the axis of the rotation. 

The axes AA', RS', CC* of the rotations of order 2 are the 
coordinate axes. They intersect the sphere in the six special points 
for which the invariant t vanishes. 

The eight rotations of order 3 of jf2i2 have as axes the diameters 
through the vertices. The latter are the four special points for 
which the invariant $ vanishes. The remaining intersections of 
those diameters with the sphere are the vertices of the diametral 
tetrahedron and hence are the four special points for which the 
invariant vanishes. 

Consider any homogeneous polynomial in u and v which is an 
invariant of If it vanishes for a special point, it has one of the 

factors tj After removing all such factors, we obtain an 

invariant quotient Q which vanishes for no special point. Since 
Q (w, v) is homogeneous in u and v, the equation Q(z,l) =0 has a 
root z'} so that Q(w, v) vanishes for w = ?7, v = U, where U/V 
— z . Let / and g denote any two of the absolute invariants 

each of degree 12. Then also / — cgr is an absolute invari¬ 
ant, if c is any constant. The condition that it shall vanish at 
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JJ, V uniquely determines e, since z' is not a special point and 
therefore g(U, V) 0. Hence Q has the factor / — eg. The 
quotient is either a constant or has another such factor. Proceed¬ 
ing similarly, we have 

Thbobem 5. Every homogeneous polynomial in u and v which is 
invariant under the 24 tetrahedral homogeneous linear trari^orniatioris 
is a product of factors t, '^,f — eg, where the c^s are constants 
9 ^ 0 , whde f and g denote any two of the absolute invariants 
SEf® of degree 12 . 

Hence an invariant of degree 12 which vanishes at no one of the 
6 -t- 4 -h 4 special points can be expressed in each of the forms 
a{f — c^), b(f — d^), where a, b., c, d are constants 5 ^ 0 . 
Thus f*, and satisfy a linear identity. By considering the 
terms u^ and u^ v, we find 

(18) 12 (-3)» 


Exebcisbs 

1. If 8 and z' are the steieographic projections of two diametral points on 
the unit sphere, zz' =* — 1. Hence derive (16) from (16) by replacing z 
by —1/8 in 5 (m/v)>=0. 

2. The TTftaaiii.n of * is -48-3* i W. The Jacobian of and * is 32-3* «. 
When 0 is taken as the f undament al quartio form, the invariant / is zero, while 
jr = _ 4/(3-3i i). The syzygy (22) of §12 reduces to (18). 

125. The octahedral group. In Mg. 2, the solid composed of 
the two pyramids with the vertices C and C' and common square 
base ABA'B' is a regular octahedron. It is unaltered by the 
rotations of the group Bn which leaves the tetrahedron 1234 
unchanged, together with 12 rotations which interchange the 
latter and its difunetral tetrahedron 1'2'3'4', viz., the 6 rotations 
through ± 90° about the axes AA', BB', and CC", and the 6 
rotations through 180° about the 6 diameters each bisecting two 
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edges of the octahedron. Products of these rotations replace A 
by A\ By B'y Cy C'; since only 4 rotations about OA leave the 
octahedron unaltered, there are only 6X4 rotations leaving it 
unaltered. Hence all rotations leaving the octahedron unaltered 
form a group 2224. It is generated by R and S of §123 and the 
rotation E about the f-axis through 90® counterclockwise when 
viewed from C. By (8), E is represented by 2 ' = iiSy one of 
whose two homogeneous forms is 

(19) U = 2“K1 + y = 2-^(1 - i)v. 

Theorem 6. All rotations having unaltered a regular octahedron 
form a group Bu of order 24. The octahedron group H 4 & of linear 
homogeneous transformations is generated by (19) and the two genera-' 
tors (12) and (13) of the tetrahedral subgroup H 24 . 

126. The invariants of the octahedral group EUs* Consider the 
octahedron whose vertices are the intersections of the three 
coordinate axes with the unit sphere. 

Since the function t given by (17) vanishes only for the six 
values of z which correspond to the vertices, t is an invariant of 
H 48 . We saw that t is unaltered by all the transformations of 
Under transformation (19), ^ becomes — t Hence f is an absolute 
invariant of Ha&. 

The Hessian of t is — 25TF, where 

( 20 ) 17 ==^® + 14u^ ^ ^ 

The Jacobian of t and W is the product of 

( 21 ) ^^u^iA - ZZu^v^ + 

by a constant. Since each transformation of Hab is of determinant 
unity and replaces i by ± the properties of the Hessian and 
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the Jacobian (§§3, 5) show that the transformation leaves W 
unaltered and replaces x by ± %. 

A point on the unit sphere is called a special point if it is un¬ 
altered by at least one rotation, other than the identity, of J 224 
and hence lies on the axis of that rotation. We saw that the special 
points on axes of rotations of the tetrahedral group jBi 2 are the 
points for which ^ or TV « ^ ^ vanishes. By §126, the only fur¬ 
ther special points are the central projections on the sphere of 
the mid points of the 12 edges of the octahedron. For these points 
X must vanish. In fact, since x is an invariant it vanishes for 
some set of 12 special points which is distinct from the set of the 
six vertices each counted twice, since (cf. Ex. 3 below). 

The discussion at the end of §124 therefore leads to the 

Theorem 7. IBvery homogeneous polynomial in u and v which is 
invariant under the group of 48 octahedral linear homogeneous trans-^ 
formations is a product of factors t, TV, x? / *“ where the c^s are 
constantsf while f and g denote any two of the absolute invariants t^, 
TV®, X* of degree 24. The latter satisfy the linear identity 

(22) X08«* - TT® + X® = 0. 


Exercises 

1 . Why is invariant under fTia? 

2. R 24 is simply isomorphic to the symmetric group on 4 letters representing 
the four diagonals 11',..., 44' of the cube. 

3. The diameter which bisects the edge joining (0, 0,1) and (1, 0, 0) 
meets the unit sphere at (i2 0, J21), whose s is 1 -h 2 K For this value of 
u and for 1 ; = 1, x =* 0* The invariant x therefore vanishes at the central 
projections on the sphere of the mid points of all 12 edges. 

4. There exists a fundamental system of invariants of any finite group G 

of linear transformations. By BHlbert's theorem { §17) every invariant / of 
O is expressible in the form 7 ^ Jffi Zi + • • • + 7m, where the 7,* are in¬ 

variants. Apply the transformations of G and add. Why does this proof hold 
also for relative invariants? 
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127. The icosahedral group. Consider a regular icosahedron I 
inscribed in a sphere having the center 0 and radius unity. It has 
20 triangular faces, 12 vertices, and 30 edges. If V is any vertex, 
I is unaltered by the rotation S about the axis OV through angle 
2t/6 counterclockwise viewed from 7. Letter the triangular 
faces 4o VAij Ai VA 2 ,. • •, Ai VAo having the common vertex 
V so that S carries Ao to Ai, Ax to A 2 ,... , -4.4 to Ao, The rotation 
through angle tt about the axis joining 0 to the mid point of an 
edge AB will be denoted temporarily by {AB\\ it leaves I un¬ 
altered. Write T for [Ao 7]. Since S carries the mid point of 
Ao 7 to that of Ax 7,5“^ TS leaves the latter point unaltered and 




§127] 


IC08AHEDRA.L GROUP 


233 


hence is [Ai V]. Similarly, T8^ = [Aj 7]. We shall speak 
of these five rotations [A,- V] of order 2 and the five powers of S 
as the ten rotations connected with vertex V. They are trans¬ 
formed into the ten rotations connected with vertex A* by 
[A k 7], which replaces 7 by A A glance at Fig. 3 shows that the 
vertex 7', diametrically opposite to 7, is the only vertex which 
cannot be reached by an edge starting from one of the vertices 
Ao,.. ., A4. If Ao B is either of the two edges having Ao as one 
end’ point and distinct from A0 A4, Ao 7, AoAi, the rotation 
[Ao B] transforms the five rotations about the axis OAq into the 
five rotations about the axis OB, and one of the latter carries Ao 
to 7'. We have now proved that suitably chosen products of S 
and T carry 7 to all 12 vertices. The five powers of S are the only 
rotations of J into itself which leave 7 unaltered. Hence there 
exist exactly 5 X 12 rotations of I into itself. 

Theorem 8. S and T generate the icosaJiedral group Reo of all 
rotations of I into itself. ’ 


Exercises 

1. Since T interchanges V and Ao, as well as Ai and A 4 , the product 
ST = (7Ao A 4 ) • • • is a rotation of order 3 whose axis joins 0 to the mid 
point of face 7Ao A 4 . 

2 . The proof that S and T generate the ten rotations connected with any 
of the five vertices A shows similarly that they generate the ten rotations 
connected with any of the five vertices of type B and the ten connected with 
7'. 

3. Hence S and T generate the 6 X 4 rotations of order 6 about the 6 
diameters joining pairs of opposite vertices, the 16 rotations of order 2 about 
diameters each of which bisects two edges, and (Ex. 1) the 2 X 10 rotations 
of order 3 about diameters each of which passes through the mid points of 
two faces. These with the identity rotation give the 60 rotations of Eeo. 

128. The linear homogeneous icosahedral group H 120 - Choose 
rectangular coordinate axes so that the positive f-axis extends 
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from 0 to 7 and the axis of the rotation T lies in the second and 
fourth quadrants of the ^f-plane. By (8), rotation — ez, 

one of whose two homogeneous forms is 

(23) V=^v, 

The vertex Aq is represented by a real negative number n. 
Since 8 carries Ai to Af+i, the vertices Ai, A 2 , -As, A^ are repre¬ 
sented by Since T interchanges Aq and 7, 

which are represented by n and 00 , and interchanges the. diamet¬ 
rically opposite points, represented by m = — rr^ and 0, we see 
that 

m , nz+I 

T : 2 ;' - 

z -- n 


Since T interchanges Ai and Aa, z = en implies z' = n. Hence 
n2(l — € — — 1, or n^(€ + = 1, But € + ^ is positive 

and is negative. Hence ?i(e+ = — 1. Applying the powers 

of S (z' = 6z) to m and n, we conclude that the 12 vertices of the 
icosahedron are represented by 

2 ! « 0, 00 , €* w, 6* n 

^ ^ 1, 2, 3,4; m-€+eS n = 

Multiply the numerator and denominator of T by g* — c®; we 
get 

m. ; ^ (e- ^)Z + ^ ^ 

(g® — ^)z — (g — g^)^ 

whose determinant is 5. Hence a homogeneous form of T is 


(25) 


5* ?7 = (e — ^)u + (g® — 
5^7= (g®- (e- 
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As a check, the square of the latter isl7= — m, 7=-t). 

Thbobem 9. The icosdhedral group Hm is generated try (23) and 

(26). 

129. The invariants of the icosahedral group H 120 . The form 
which vanishes at the 12 values (24) of 2 = u/y is 


/ = ««(«' — m®«®) (u® — ra®«®). 


We desire for r = 5 the sum s, of the rth powers of the roots m 
and nofa^ + a: — 1 = 0. Multiply by a;’’"®, insert the roots, and 
sum; we get 

S, + Sr-l — Sr_a = 0. 


Using this recursion formula, we get 


Ss = — S 4 + S 3 = 2 s 8 — S 2 = — 3 s 2 + 2 si = 5si — 6 = — 11 . 


Since mn = — 1, we have 

(26) / = m!;(m“ + llu® ti® — «“). 

The Hessian of / is 121fl', and the Jacobian of / and U is 20r, 
where 

(27) E - - uM _ ^0 + 228(u“ a® - it® «“) - 494u“ 

(28) !Z’=u®®+u»“+522 (u®®»®-w® t>®®) -10005 (u®« v*®). 

A special point of the sphere is defined to he a point which takes 
fewer than 60 positions under the group fiso. Hence it is a point 
on the aids of one of the rotations other than the identity. The 
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enuDcieratioii of these rotations in the preceding Ex. 3 shows that 
every special point belongs to one of the following sets: the 12 
vertices (at which / vanishes), the central projections on the 
sphere of the mid points of the 20 faces, and those of the mid 
points of the 30 edges. An aggregate of two or more sets, not 
necessarily distinct, never gives a set of 20 or 30 points. Hence 
the invariant E vanishes at the central projections on the sphere 
of the mid points of all 20 faces, while the invariant T vanishes 
at those of the mid points of all 30 edges. 

Evidently/is unaltered by transformation (23). Computation 
shows that / is unaltered by (25), and hence is an absolute in¬ 
variant of Em* The same is true of E and T since the trans¬ 
formations have determinant unity. Proceeding as at the end of 
§124, we obtain 

Theorem 10. Every homogeneous polynomial in u and v which is 
invariant under the icosahedral group Em is an absolute invariant 
which is a polynomial in the invariants E, and T, The latter satisfy 
the identity 

(29) = 1728/5 - E\ 

130. The form problems for the tetrahedral and octahedral 
groups. By §124, every polynomial absolute invariant of the 
homogeneous tetrahedral group H 24 is a polynomial in the absolute 
invariants t^W — and which satisfy the identity 

(30) 4«3[12(- 3)^ t^+ 0. 

Given arbitrary values for t, W, compatible with relation (30), 
we seek the sets of values of w, v. This is the form problem for 
Ezi* Prom one set of solutions u, u, we evidently obtain 24 sets 
by applying to it the 24 substitutions of H 24 . That there are ex¬ 
actly 24 sets of solutions is seen as follows: Write 


(31) 


Zi — X(u, v) = W/ty 
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so that Zi and X are the given values of these absolute invariants. 
The first quotient is a function of 2 = u/v. The corresponding 
equation 

(32) (2* - 2 • 3* + 1)® - Zi(2* + 2 • 3i iz^ + 1)* = 0 

has 12 solutions s. For each such 2 , we may compute X( 2 ,1) and 
then find two sets of values of u, v from 

(33) — X{u, v)/X(z, 1), u — zv. 

The fiirst relation (33) follows from the fact that the degree 8 of 
W exceeds the degree 6 of t by 2. The form problem for Hn there¬ 
fore reduces to the solution of equation (32) and extraction of 
square roots. 

Equation (32) is invariant under the group F-m of aU tetrahedral 
linear fractional transformations on 2 . Four of them (of. Ex. 3, 
§123) are 2 ' = + 2 , 2 ' == ± 1/s, and these leave invariant the 
function 


(34) Z 2 = (s* - 1)V(42*). 

Employ the cube root m = |(—1 + 3 N) of unity. Then 
Zi-u’= (s* - 2 • 3» 22* + 1)/ (4s*), 
while Z 2 — M* is obtained by changii^ ito — i. Hence 


(35) 


( Zi ~ w 

Z* - CO* 



Hence (32) may be solved by extracting a cube root (35) to get 
Zi, solving the quadratic equation (34) in s*, and extracting the 
square root of the resulting value of 2 *. 
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For the form problem of the octahedral group Hm, we are given 
arbitrary values of the absolute invariants W, tx, compatible 
with the relation (22) or 

<“(108<* - F») + (t xT ® 0, 

and seek the 48 sets of values of u, v. Write 

(36) Z = X(u, v) = W tlx- 

Since X is of degree 2 in w, ti, we again have (33). For Zi = $*/’*'*, 
we see from (30) and W = that 

(37) -3 -144^1/(21-1)* = ^. 

The first equation (36) is of degree 24 in the unknown z. Its 
solution is equivalent to the solution of the quadratic equation 

(37) for Zi and the earlier equations for finding z from Zi. 

Thbobbm 11. The form ■problems for the tetrahedral and octor 
hedral groups are sohcAle by radiads. 

\ 

131. The form problem for fire icosahedral group Huo- Given 
arbitrary values of the absolute invariants/, H, and T, compatible 
with relation (29), we seek the 120 sets of values of u, v. Write 

(38) Z = f/'P, X{u, a) = / H/T. 

Since X is of degree 2 in u, a, we a^dn have (33). Hence the forin 
problem for Hm reduces essentially to the solution of f* — Z 
= 0, which after division by a“ becomes 

s«(aw + Ha* - !)« - ^[s“ + 1 + 522(a“ - a*) 

- 10005 (a®" + a“)? = 0. 


(39) 
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TIus equation of degree 60, having the single parameter Z, is 
called the icosdhedral eguaiion. When Z is arbitrary this equation 
will be shovm to be not solvable by radicals, in contrast with the 
form problems for the tetrahedral and octahedral groups. Our 
6 rst step will be to reduce the solution of (39) to that of a remark¬ 
able quintic equation. 

In (26) we gave the function f which vanishes for the twelve 
values of 2 = ufv which represent the vertices of the icosahedron 
I. By inspection, / is unaltered by the transformation 

(40) U ^ - V, 7 =«; 2 ' = - 1/s. 

Hence the corresponding rotation F leaves I unaltered, so that 
(40) belongs to the group Hm- In §§127-8, we defined the rota¬ 
tion T through 180® about the axis OP in the |f-plane, where P 
is the mid point of the edge Ao 7. By Ex. 3, §123, F is the rota¬ 
tion through 180® about the i^-axis, and the product TF is the 
rotation through 180® about the axis which is a common per¬ 
pendicular to Oij and OP. 

To find the points unaltered by (40), take s' == s, whence 
s* H- 1 = 0. To find the points unaltered by T, write (25) in the 
non-homogeneous form and take s' = s. Similarly for TF. In 
the resulting three equations, we replace s by u/v. Hence the 
pairs of points unaltered by F, T, TF are respectively those for 
which 

Ao = «*-{-a*, Bo = «* - 2(6*-f- 

Co = u* - 2(6 + ^)w - a* 


vanish. Hence the points for which 

(41). <0 = AoJSoCo = u« -b 2w»o - 5w*a* - 5m* a« + 

vanishes correspond to the vertices of a regular octahedron. Thus 
to is the fundamental invariant of the octahedral group referred 
to axes different from those used in §126. By (23), we have 
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(42) S*: V=^’‘u, 7= 

This replaces Ao, Bo, Co by respectively 


Ah = e*w® + «** Bh= M® — 2(6® + — 6^*»®, 

0* = 6* W® — 2(6 + ^)uv — 6^*B®. 

Write <4 = AkBh Ch- For T defined by (25),^ 

UV = i-l(M® + W-V^), Z = (6® - 6®) (6 - 6«) = 5i. 

We readily verify that T replaces Ai by mBo, where 

TO = |(2e- 6®+ «®-26^), 

Cl by (^ — 6 ®)j 4«, and Ci — Bi = — 2luv by 

— 2(w® + wi; — «®) = (e® — €®)il2 — (6® + «®)C2. 

Hence T replaces Bi by (^ + 6 *)C 2 . Since m( 6 ® — 6 ®) ( 6 ® + e®) 
= 1, r replaces ti = AiBi Ci by th. This implies that T replaces 
U by U since A-h, B-h, C-h are derived from Ah, Bh, Ckhy replac¬ 
ing 6 by 6 “®, which merely changes the signs of aU coefficients of 
T, a change not affecting quadratic functions of u, v. 

Since T permutes the end points of the axes of F and TF and 
leaves those of T unaltered, T changes Ao and Co into their negar 
tives and leaves Bo unaltered. Hence, since T is of order 2 , 

(43) T — (fe) (Zi Zj) (4i U), S = (.totitits U)‘ 

Theobbm 12. 7'he icosahedTol growp of 60 rotatioTis or of 60 
linear fractiond transformations is simply isomorphic, to the alter¬ 
nating group on fwe letters. 
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This may be seen geometrically. The 15 diameters each of which bisects 
two edges of the icosahedron fall into 5 triples each of three mutually per¬ 
pendicular diameters Oike the axes of F, T, TF). Each triple can be converted 
into all the remaining triples by rotations of Fm- Hence exactly 60/5 or 12 
rotations of JSeo leave a given triple unaltered. The latter form a subgroup of 
the octahedral group JR 24 , whose rotations of order 4 are not in JSeo* Hence 
this subgroup is a tetrahedral group Ru. From (43) it is easy to pick out the 
12 substitutions on io,..., U which leave U unaltered and hence correspond 
to the subgroup Rn of Rw. For example, STS^ « (0)(2)(143), T, and 
F - TS^ TS^ T =s (0) (14) (23) generate the homogeneous form Hu of Eia- 

By (43) every transformation of the icosahedral group Hvto 
permutes , ^4 amongst themselves and hence leaves unaltered 
their elementary symmetric functions. Thus tof,.. are the 
roots of a quintic equation 

+ Cl + ^2 + C 3 + C 4 ^ + Cs = 0, 

where is a polynomial of degree &k in u, v which is an invariant 
of Hm> The largest 6k is 30 and hence is < 60. Thus by §129, 
Cfe is a product of factors/, H, T, whose degrees are 12, 20, 30. 
Since no such product is of degree 6 or 18, we have Ci = cs = 0. 
Also, C 2 = a/, C 4 = bp, C 6 = cT, where a, 6, c are constants. 
Letsjfebethesumof the fcth powers of the roots. Since Ci = cg = 0, 
two of Newton’s identities reduce to S2 + 2c2 — 0, S4 + C2 52 
+ 4 c 4 = 0. Thus 

(44^ 52 + 2a/ = 0, S4 + (4b - 2a^)P = 0, 

cT 4“ ^0 fe fo ^4 = 0. 

Since in (42) replaces k by itt, we have 
(46) + + 

whence 

th ^ _ 6€^^u^^v^ + .... 

If'k = + 8e^ V + 4u®® + .. •. 
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The coefficients of vP- v, ti® in the respective functions 
(44) are 

20 + 2a = 0, 20 + 4b - 2o* = 0, c + 1 = 0, 

whence a = — 10, b = 45, c = — 1. This proves 
Theobem 13. The octahedral forms U, •.. ,U are tbe roots of 

(46) ^-10/<» + 45/*<-r = 0. 

This equation was first obtained by Brioschi (§120). 
Introduce in place of the root t of (46) the new unknown 
w — tp/T, which is of degree zero in u and v and hence is a func¬ 
tion of z. In (46), replace t by lo T/P, multiply all terms by 
P^fTP, and write Z iotPITP, in accord with (38). We get 

(47) - 10Zio» -f- 45Z2 to - 2* = 0. 

132. The principal quintic resolvent of the icosahedral equa¬ 
tion. We employ the functions h defined by (41) and (46). The 
Hessian of is 400 W », where 


(48) 


TFjfc = — M* + 6** 0 — 7 e** u* 0® — 7 6* 0® 

-f 7€‘* U* 0® - 7«®* ti® t;« - - 6* 0®. 


Since the generators T and S of the icosahedral group Hm are 
of determinant unity and permute it,... ,U according to the 
substitutions (43), they permute their Hessians and hence 
Wo,..., W 4 according to the same substitutions. Thus 
t jT TFjt* is unaltered by T and S and is therefore an invariant 
of H 120 , and hence is a polynomial in /, H, T, whose degrees are 
12, 20, 30, respectivdy. No linear combination of the latter with 
integral coefficients ^ 0 reduces to 8, 14,16, 22, or 28. Hence 
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Thus Sy = 0, 2}l^ = 0,if 

(49) 7* = aTf * + r h Wk (k = 0, 1, 2, 3, 4). 

Hence the latter are the roots of a principal quintie 

(50) P + 6oP + 567 + c = 0. 

By Newton’s identities, we have 2 P = ~ 15o, 2 
= — 206. An inspection of the degrees of the invariants yields 
the following identities except as to the prefixed numerical factors 
which were found by a comparison of single terms of each member: 

2 F» = - 5 • 24f, £ tip = - 67, 

= - 5 ■72f, 2 f» T7» = - 15/7, 

2 IF'* = 200, - 60PH, 

52 = - 5HT, £ <* 17* = - 540/»H. 

Also 22 = 0. Hence 

a = 8f <r* + 7o^ T + 72f <rT» +/7 t», 

6 = - 0a* + 18/* H<r* T* + H7(rT» + 27f HA 

To compute c, note that (46) gives 

- 10/a!» + 45/*a: - 7 s II(a: - <*), 

identically in x. For ® = — a/r, this gives 

II((r + Tfi) = (T* - IQfff* T* + 45/* ffT* + 7 t». 
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But H PF* = — the numerical factor being found by com¬ 
paring one term of each member. Hence 

c = - n - 10/<r* t 2 -b 4Sf <rT« + Ti^). 

To obtain a resolvent of the icosahedral equation (39), we pass 
to a quintic equation which involves /, H, T explicitly only in the 
combinations 

Z = p/T^, V = m/p, 

where 

(51) Z-i + 7 = 1728 


by the identity (29). To this end, write 

(52) V = X//H, r = y^P/iJET)- 


Ilien 

(53) 


■ 7 a = 8X» + X" M + (72X/t* + M*)^, 

■ 75 = - X*-H18X*/t*^+ Xm’‘2 + 27a*«Z 2, 
.7c = X' - 10X»+ 45 Xm*Z* + Z\ 




When these values of a, i, c are inserted into (50), we obtain the 
principal resolvent of the icosahedral equation. 

133. Identification of any principal quintic witii the principal 
resolvent. Given a, b, c, we shall prove that equations (51) and 

(53) can be satisfied by choice of X, fi, Z, 7. To the third equar 
tion (53) add the product of the second by X; we get the product 
of the fhst by Z, whence • 


(54) 


X6 “b c = /i® Za, 
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Multiply tlie third equatioa (53) by X and the second by ju® Z 
and subtract j we get 

(55) 7(Xc - 7h) = (X* - 2)*. 

From the first two equations (53), we get 

F(Xa + 86)/m = X» + 216X2 vZ + QXm* Z + 216m» 

Divide the square of this by Z and subtract the result from 27 
times the square of the first equation (63), and simplify by use of 
(51); we get V (X* — 3^* Z)®. Employing (66), we get 


27a2 - 


(Xffl + 86)2 
V?Z 


Xc — 2jb. 


El imin ation of /i® Z by means of (54) gives 

(56) ~ ~ 6 - oc® + 26* c) 

+ 64a2 6* - 27a» c - 6c* = 0. 

After finding a root X of this quadratic, we deduce /** Z from (54), 
V from (55), and then Z from (61). By means of the resulting 
value of jit* and the first equation (53), we can find /i rationally. 

The root X involves the discriminant A of (56). Let P denote 
the product of the squares of the differences of the ,roots of 
ffiy) = 0, defined by (50). It is known* that P is the resultant 
of g (F) and its derivative, whence P = 6® D, where 

D = 108a® c - 135a® 6* + 90a* 6c* - 320o6* c + 2566® + c*. 

We find ihat A = a* D. In the theory of invariants, D (and not P) 
is called the discriminant of (50). 


^Dicskaoii*s First Course in the Theory of Equations, 1922, 152. 



246 REGULAE SOLIDS, QUINTICS (Ch. XIII 

Hence equations (63) and (51) may be solved rationally for 
X, /i, Z, Y in terms of a, b, c, D^. 

Thbobbm 14. A general ‘principal guintic equation may be identi¬ 
fied •with the prindpal resolvent of the icosahedral equation. The ordy 
irrationality introduced in making this identification is the square 
root of the discriminant of the principal quintic. 

134. General quintic equation. By means of a transformation 
involving a square root, we may reduce the general quintic to a 
principal quintic (§118). We just proved that the latter may be 
identified with the principal resolvent in §132 in which X, p, Z, Y 
are constants; its roots are 


(57) 


_ 'KfW k uf^ tkWh 
~ HT 


(/fc = 0,...,4), 


which are of degree zero in u and v and hence are polynomials in 
z — u/v. Hence the general quintic equation can be solved in 
terms of two square roots and a root z of the icosahedral equation 
(39). 

The latter can be solved in terms of elliptic modular functions.* 
If with Klein we regard as known a root z= ^{Z) of the icosar 
hedral equation, we may compute the roots of the general quintic. 

135. Tschimhaus transformation of the special Biioschi re¬ 
solvent (47) into the principal resolvent of the icosahedral equa¬ 
tion. Tor the S 3 rstem of coordinates used in §§123-6, the values 
of u :» for which the Hessian of the octahedral form t vanishes 
correspond to points on the sphere which are on axes of rotations 
of order 3. Since the same is therefore true for the system of 
coordinates in §§128, 131-2, the Hessian W of the new form t 

1 Klein, Math. Annalen, 14, 1879, 157-8. Bianohi, Oni., 17, 1880-1, 264r-7. 
Klein, Ikosaeder, 1884, 130-^; Theoiie der Elli ptianhan Modulfunctionen, I, 1890| 
125. 
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vanishes for values of 2 corresponding to mid points of faces of the 
icosahedron. Hence IF is a factor of H. Another factor is — 3/. 
For, if we transpose the term T of (46), square and replace f 
by 3/, we get 3/(24/®)® s 1728f = T®, whence H = 0 hy (29). 
A comparison of the coefficients of one term of each member gives 

(58) H s F(«® - 3/). 


From (57) with the subscripts k suppressed, (58), t = w T/P, 
and Z ~ P/T^, we get 


(69) 


X "H fMW 


Theorem 16* The Techirnhaus transformation (59) replaces 
resolvent (47) by the principal resolvent of §132* 

Since (47) is a quintic well adapted for solution by elliptic 
functions, we may so solve the principal resolvent and hence any 
quintic* 

Exercise 

Give a direct proof that (59) transforms (47) into the principal resolvent. 
Apply Sylvester's method to eliminate Wj using (47) and its product by 
and the quadratic equation in w equivalent to (59) and its products by to, 

w®, w*. Equating to zero the determinant of these seven linear equations 
in , to, 1, show that we get (50), where a, 6, c are given by (53) and 

(51). 

136. The Galois group of the icosahedral equation. Let Feo de¬ 
note the group of the transformations 

a' = Liiz) (e - 1,,.,, 60) 

which are the linear fractional forms of the homogeneous trans¬ 
formations of the icosahedral group. In view of the generators 
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(23) and (25) of the latter, the Li(z) involve the single irra¬ 
tionality e. Let I (z) denote the left member of the icosahedral 
equation (39). Its coefficients belong to the field F — R{Z, e) 
and are invariant under the group Fw Hence if is any root of 
I ( 2 ) = 0, all its roots are given by 2 < = Li{zi) tori — 1,..., 60. 
Thus any factor of I ( 2 ) whose coefficients are in F vanishes for all 
60 roots, whence 1 ( 2 ) is irreducible in F. 

Since Feo is a group, Li[Lj{z)] s Lk{z). Hence the replacement 
of 2i by Zj gives rise to a substitution Sj on the 60 roots 

2 |= LiiZi). 

We shall prove that the set of these 60 substitutions Si has the 
characteristic properties A and B of §89 of the Galois group of 
J( 2 ) =0 for F. Let/(si,..., 26o) be any rational function of the 
Zi with coefficients in F. When each 2 < is replaced by Li{zi), let 
/become ^( 21 ), whose coefficients belong to F. 

First, let / be unaltered in value by each of the substitutions S,-. 
Then <^(si),..., tl>(zea) are all equal. Their sum, which is equal 
to 60/, is a symmetric function, with coefficients in F, of the roots 
of I ( 2 ) = 0. Hence / is equal to a quantity in F. 

Second, let / be equal to a quantity q in F. Then <j>(z) = g is 
an equation with coefficients in F which is satisfied by one root Zi 
of the equation I ( 2 ) = 0, irreducible in F, and hence (§80) by 
all its roots 2 ,-. In other words, the <^( 2 ,) are all equal, so that/ 
is unaltered in value by all the substitutions Si. 

Theobbm 16. The Galois group G of the icosahedral equation 
(39) for the field R{Z, e) is simply isomorphic to the group Fm of 
the linear fractional icosahedral transformations. 

Hence, by Theorem 12, G is simply isomorphic to the alternating 
group on five letters. The icosahedral equation is therefore not 
solvable by radicals (§111). 

Theobeh 17. The solution of either the icosdhedrcd equation or its 
resolvent qvintic equation (47) is equivalent to the solution of the 
other. 
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For, if we have found a root z of (39), we may compute at once 
the rational functions Wk ^ tkPIT of z and hence have the roots 
of (47). Conversely, if we have found the roots Wk of (47), we 
may compute z and hence all the roots of (39) as rational functions 
of the Wk^ Z, €. For by Theorem 12, the identity is the only trans¬ 
formation of Feo which leaves unaltered each tk and hence each 
Wk, whence the identity is the only substitution of G which leaves 
each Wh unaltered. Hence the adjunction of aU the to the field 
F reduces Q to the identity group, so that each root of (39) is in 
the enlarged field. 


Exercises 

1. The solution of the principal resolvent of the icosahedral equation is 
equivalent to the solution of the latter. Then by Theorem 17, the roots of the 
principal resolvent are rational functions of Z and the roots of resolvent (47), 
and conversely. This would have enabled us to predict the existence of a 
Tschirnhaus transformation which replaces (47) by the principal resolvent 
( §135). 

2 . The icosahedral equation is its own Galois resolvent. 

3 . For a field containing Z\ and (—3)^, the tetrahedral equation (32) is 
its own Galois resolvent, and its Galois group is simply isomorphic to Fia 
and hence to the alternating group on four letters. Thus (32) is solvable by 
radicals. 

137. Further results. The solution of the general quintic may 
be reduced to the form problem of a group of 60 linear transforma¬ 
tions on three variables.^ The general sextic has been reduced to 
the form problem of Valentiner^s group of 360 ternary linear trans¬ 
formations which is simply isomorphic to the alternating group on 
six letters.^ The general equation of degree 7 reduces to the form 
problem of a linear group of order 171 on 4 variables.® 


^ Klein, Ikosaeder, 1884,211-60. 

“Klein, Math. Annalen, 61, 1905, 50-76 (reprinted from Jour, fur Math., 129, 
1905, 151-74); Math. Abhandl., II, 1922, 481-502. 

“Klein. Math. Annalen, 28, 1887, 499-532; Math. Abhandl, II, 1922, 439-72. 
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The symmetric group on n letters can be represented as a linear 
homogeneous group on variables xi,... ,Xn subject to the relation 
= 0(cf- §14:0). But^ for n> 7, neither the symmetric 
nor the alternating group on n letters is simply isomorphic to a 
linear group on n — 2 or fewer variables. In other words the 
general equation of degree n ^ 8 can be reduced to a form prob¬ 
lem of order n — 1, but not lower. For a clear summary of these 
and related topics, see Wiman’s article® in EncyklapSdie Math. 
TFiss., 1,1,1904, pp. 522-54. 

^Wiman, Math. Annalen, 52, 1899, 243-70. 

®Later literature: E. H. Moore, Math. Annalen, 51, 1898-9, 417-44. Coble, 
ibid., 70. 1911, 337-50; Trans. Amer. Math. Soc., 9, 1908, 396-424; 12, 1911, 311- 
26, Dickson, ibid., 9, 1908, 121-48; 12, 1911, 75-97. Speiaer, Matii. Annalen, 77, 
1916, 546-62. Schur, Berlin. Berichte, 1908, 664-78; Jour, fur Math., 127, 1904, 
20; 132, 1907, 85; 139, 1911, 155-64 (of. de S6guier, Jour, de Math., (6), 6, 1910, 
387-436; 7. 1911. 113-21). Weber, Algebra, ed. 2, 1899, II, 228-301, 373-89. 
470-550. 
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REPBESENTATIONS OF A FINITE GROUP AS A 
LINEAR GROUP; GROUP CHARACTERS 

138. Introdttctioii. As a sequel to the preceding chapter, we 
shall now prove some remarkable theorems on the representations 
of a given group as a linear group, and also give an introduction to 
the theory of group characters. The latter theory is an effective 
tool for ^te groups and has led to theorems not proved other¬ 
wise, sudi as the fact that every group of order p“ is solvable 
if p and q are primes, and that every transitive group of prime 
degree is either metacyclic or doubly transitive. 

Starting with a simple example due to Dedekind, B^obenius 
developed the theory in a series of complicated memoirs.* We 
shall follow the simpler exposition given by Schur® partly because 
it is unusually attractive and partly since alternative introduc¬ 
tions* to the subject are already available in English.^ 

139. Reducible linear groups. Let a finite number g of n-rowed 
non-singnlfl,r matrices R form a group such that there exists a 
non-singular r^-rowed matrix P for which 


1 Berlin. Bericbte, 1896-1903. The author gave au elementary escposition of 
[Frobenius's theory in Annals of Math., 4,1902, 25-49; also an extension to groups 
of transformations modulo Trans. Amer. Math. Soo., 8, 1907, 389-98; Bull. 
Amer. Math. Soc., 13, 1906-7, 477-88. 

* Berlin. Beriohte, 1906, 406-32. Hsrpercomplex numbers were employed in the 
treatment by MoUen, Sitzungsber. Naturf. Gesell. Dorpat, 11, 1896, 269-88. 

*Bliohfeldt, Finite CoUineation Groups, University of Chicago Press, 1917, 
116-38. Miller, Blichfeldt, and Dickson, Finite Groups, 267-78. Burnside, 
Theory of Groups, ed. 2, 1911, 243-371, 464-84, 499. 

^The author added §146 and the examples. 
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for all g matrices R. Here Ar and Dr are square matrices with a 
and d rows respectively, Ur has- d rows and a columns, while all 
elements of matrix 0 are zero. Then the group is called reducible. 

Thboeem 1. The matrices of a reducible group can be trans¬ 
formed sirnvUarmusly intd'- 




With R and S, the given group contains RS. Thus 
Mrs = P-i RSP = P-i RP ■ P-i SP = ATr Ms, 


whence 


CD 

•^RS ““ -Dr A 

(2) 

^RS “ ^^R *^S “1“ ^R 


Multiply the members of the last equation on the right by 
Ag-i and sum for the g values of S. Since Ag Ag-i = Ai„ is the 
a-rowed identity matrix, we get 

E Crs Ag-I = gCR + Dr - gP, P = i 2: Ug Ag-.. 

s g 6 

In the first sum replace S by P-* S. Using (1), we get 

ECsAs-iAr = ^PAr. 

s 

Suppressing the common factor g, we get 


^ MBschke, Math. Aonalen, 62, 1899, 363. 
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(3) FA^ = C^ + D^F. 

Consider the matrix of determinant unity 



By (3), Mr Q = QN^i where Nr is ^ven in the theorem. Hence 
PQ is non-singular and transforms the given group into the group 
of matrices Nr. 


140. Representations of a group as a linear group, group matrix. 
Consider a group of substitutions on n letters. Let any two of its 
substitutions be 


Kill tjt rtJ’ \ti fs 



To R make correspond the linear transformation 


r * Vi-i • * • > Vn* 

To S corresponds s: vi = h, • ■ ■ > Vn = tn- Hence to BS corre¬ 
sponds rs: = fi,..., in = f», if we define the product rs as 
in §22. 

Thbobbm 2. Any svisUtuUon group on n letters is simply iso¬ 
morphic to a linear group on n variables. 

A substitution group may be simply or multiply isomorphic to 
various linear groups. 

Let Si,..., Sj be the substitutions of a group G (or elements 
of an abstract group 6 ), whose identity is Si. For i = I, • •. i 
let M,i be an /-rowed square matrix such that 
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(4) M,i Mif — M,f,j {h j — 1> • • • > 

and sucy that not every M,i is singular. By (4) for j = 1, we 
have — 7) = 0, whence = I, the /-rowed identity 

matrix. Since M,i = JIfq = I, every is non-singular. 

Then the set of linear transformations having the matrices 
Af,i,..., Jlf»j is called a representation of (? as a linear group. 
When the M’s are not all distinct, the linear group contains 
duplicate transformations and is multiply isomorphic to O. 

To enable us to treat all the M’s rimidtaneously, we introduce 
g independent variables = 1,..., j) and call 

(5) -x: = S 

i-i 


the grmip matrix corresponding to the representation of O as the 
linear group {M«i,..., M.^}. 

For example, let si = 7, S 2 = (12) (34), Ss = (13) (24), 
Si = (14) (23). A group matrix is 


X = 


Xi X2 Xi 

X2 Xi Xa Xz 
Xb xa Xi X2 
Xa Xs X2 Xi 


Let T denote the transformation, of matrix X, which expresses 
, ^4 linearly in terms of 9 ?i,..., 


variables 

?i = + & + fa + 

?3 = £[ — fe 4* ^4, 

= 171 + + 1?3 + V4, 

= 1/1 — 172 + 173 — 174, 


Introduce the new 

ft = + & — & “ 

ft = 6 l “ ft ft + ft, 

6)2 = 171 + 772 — 778 — 774, 

6)4 = 171 — 772 — 773 + 774. 


^This assumptiozi, not made by Scbnr, simplifies some proofs. 
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?i = (ail + % + »3 + 

^2 = Xi — Xa — Xi)c02, 

fs = {Xi- Xi + Xa - Xi)<aa, 
f4 = {xi — Xt — Xa + Xi)wi. 

141. Irreducible group matrices. If X is a group matrix and 
P is a constant matrix whose determinant is not zero, then 
p-i XP is a group matrix called equivalmd to X. 

Lemma. If X and Y are two irredudbh group matrices of orders 
f and h for which XP = PY, where P is a constant matrix with f 
rows and h columns, then either P = 0, or dse f - h and the deter¬ 
minant of P is not zero, so that X and Y are equivalent. 

Suppose that P 0. Then P is of rank r > 0. Write s = / — r, 
t — h — r. By §30, there exist non-singular matrices A and B 
with / and h rows, respectively, such that 


«- - (i;. 


Zrl\ 

zj' 


where It is the Mowed identity matrix, while 2,«is the zero matrix 
with r rows and t columns. Write 


AXA- - z. - g;; f 

where Xrt has r rows and t columns, etc. We have 


Xi Q = AXA-^ Q = AXPB = APYB = APBYi = QYi. 
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Hence 


(X„ 

Zrt\ 

AV (Y rr 

Yri\ 

vx.. 

zj 

- - u. 

z.J’ 


whence X,r = 0, Yrt = 0. If either s > 0 or < > 0, X or F would 
be reducible, contrary to hypothesis. Hence s = t = 0,r=f = h, 
so that Q = If and P = A~^ QB~^ is a square matrix whose 
determinant is not zero. 

CoROLLAEY. Every constant matrix M commutative with an 
irreducible group matrix X of order f is of the form clf. 

For, if c is a root of \M — z If \ = 0, P = M — clf is a con¬ 
stant matrix whose determinant is zero such that XP — PX, 
whence P ~ 0. 

If P is any one of the elements Si, of a group Q, the 

corresponding /-rowed matrix Aa (previously denoted by M^) of 
a representation of G as a linear group will be designated by 
(aijs.). In a second representation of G, let R correspond to the 
fe-rowed matrix These notations are employed in 

the proof of the following 

Theorem 3. Consider any irreducible group matrix 



I! 


of order f. The coefficients of 



Xii — ^ CtijlLXTi 

{B = si,.. 

• > 

satisfy the relations 



Q 

(6) = -zdii djk 

{if ji kf 1 == 

1 ,...,/), 

where da = 1, dij- = 0(i If 



Y - (2/p.) 

II 

.,h) 
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is a second imducihle group matrix of order h belonging to the same 
group Of which is not equivalent to X, and if 

. VpQ. “ iE!) Si) • .., Sff), 

then 


(7) = o (ii-\ 

For any two elements R and S of 0, we have 



-4-r-^b -^Rs> -Sr^s -Srs* 

Let U — (uif) be an /-rowed square matrix whose elements 
Uij are arbitrary constants, and write 

( 8 ) V^EA^-tUA^. 

R—ai 

Then 

i4g-l VAq — R""^ ^-4.R8* 

B 

The second member is equal to V since RS ranges over the ele¬ 
ments of the group G when R ranges over them. Since Aq-i A^ 
^ Aj^ = Iff we have VAq — Aq V. Hence 7 is commutative 
with every Aq and hence with X. By the Corollary, 7 = y //. 
By (8), t; is a linear homogeneous function of the u’s: 

V ^^CihUik^ 
h 


R j, k 


Hence by (8), 
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for every Z, and for all values of w,**. Hence 

(9) O/kl'B. ^ Cjk (fj 3) ^ ^ 1, . . • yf')» 

a 

Take I = i and sum for i == 1,..., /; we get 

R » 

But Ar Ar -1 = Aj =* 1/ implies 

X) ^JfetR = dkj-- 

t 

Summing tins for iJ == si,..., Sg, we obtain the left member of 
the preceding equation. Hence fcji = gdkj- Inserting into (9) 
the resulting value of Cjk, we get (6). 

Let W = (wip) be a matrix with / rows and h columns whose 
elements are arbitrary constants, and write 

2= ZAk-iFHb. 

R 

As before, Ag-i ZB^ = Z, whence ZY = XZ, Since X and Y are 
not equivalent, the Lemma shows that 2 = 0. Hence 

^pflR = 0 ~ j/; ; h). 

R i.P 

Since the Wjp are arbitrary, we have (7). 

Theobem 4. If X — {xij)f Y = (y^g), are any irreducnhle 
group matrices of orders /, A,... belonging to the same group 
G = {si,..., s^}, such that no two of the matrices are equivalent, 
then the P • • • homogeneous linear functions Xi^, . o/ 

the variables x^i are linearly indep&ndent. 
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For, if the c, dy.,. are constants such that 


Cij Xij -j- dpq y^q — Oj 

*• 3 Pi Q 

then in the notations of Theorem 3, 

Ctj a</R “f“ ^ V ^PQ ~ 0- 

ii i Vi Q 

Multiply by and sum for JB = si,... , Sg, By (6) and (7), 
we get 

2 C»3 ^3fc “ 0. 

*. 3 

This sum evidently reduces to Cu- A similar proof gives dpq = 0, 

Thboeem 5. The determinant of an irredwible group matrix ia 
an irreducible function of the variables Two irreducible group 
matrices are equivalent if and only if their determinants are identicaL 

For, if Z = (xii) is an irreducible group matrix of order/, the 
Xij are P linearly independent functions of the ojn (Theorem 4). 
Hence if the Ua denote/* independent variables, thejf^ equations 
Xii = Uii determine p of the Xr as homogeneous linear functions 
of the Uij and the remaining ( §32), If the determinant ] xa | 
were a product of two polynomials in the then after assigning 
the value zero to each of those g P remaining rcR, we obtain a 
decomposition of d = | Uij | into two polynomials v and w in the 
Uij. Since d is of degree 1 in each variable w, we may assume that 
t; is of degree 0 and w of degree 1 in wu* No term of the expansion 
of the determinant d contains the product of Un by an element 
Uri of the first column. Hence v is of degree 0 in Uri, whence w is 
of degree 1 in it. Since Uro v^n is not a term of v = d, v is of de¬ 
gree 0 in every Utg* Hence a determinant whose p elements are 
independent variables is irreducible. This completes the proof of 
the first part of Theorem 5. 
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If Z — (Xii) and Y » are irreducible group matrices 
which, are not equivalent, Theorem 4 shows that we can choose 
the so that the Xij and the take any assigned values. Hence 
the determinants of X and Y are not identical. 

142. Reducible group matrices. By repeated applications of 
Theorem 1, we see that a reducible group matrix X is equivalent 
to a group matrix Y in 


( 10 ) r = 

Yi 0 

0 Ys 

0 0 

0 0 

1 

1 

, 2 = 

Zi 0 • 
0 Zi • 

•0 0 
•0 0 


0 0 

•••0 Y« 

i 

0 0 • 

•0 Zn 


where each F»- is an irreducible group matrix. 


Theorem 6. Every group matrix is equivalent to one of type Y 
in (10). 

Theorem^ 7, If a group matrixX is equivalent to both Y and Z 
in (10), where also each Ziis irreduciblej then m — n and Fi, . . ., 
Fw are equivalent in some order Zi, ..., 

For, let di and D* be the determinants of Fi and Zi, respectively. 
By Theorem 5, di and Di are irreducible. Since 

di da dm ^ D1D2 • • * Dn, 

and since a polynomial is decomposable into irreducible polyno¬ 
mials in a single way, apart from the arrangement of the factors, 
we have m = n and see that di,..., are the products of 
Di, . .., Dm in some order by constants. But for x^ = 1 , Xq^ 
= 0(22 5 *^ 7), the determinant of a group matrix reduces to unity. 
Hence the constant factors are all unity. By the last part of 
Theorem 5, matrices Fi,..., Ym are equivalent in some order to 

Z13 , .., Zm* 

iRor a more general such theorem, see Loewy, Trans. Amer. Math. Soo., 4, 1903, 
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In view of Theorem 7, we shall call the group matrices Yi,..., 
Fm the irreducible components of the group matrix X. If two of 
tbftTn are equivalent they may be taken to be equal. Then if 7, 
occurs exactly fj times, we call rj the index of 7j, and the set of 
r’s the indices of 7 or X, 

Theorem 8. If X = («<#) is a group matrix vM ike indices 
Ti,... ,Ti and if ft,... ,fi are the orders of its irreducible compo¬ 
nents, then exacUyfi^ -I- hfi^of the functions xa of the variables 

Xit are linearly independent. Also n® -h r?® is equal to the 

number v of linearly independmt constant mairices P whidi are 
commutative with X. Finally, I is equal to the number w of linearly 
independent constant matrices which are commutaHoe with X and 
mih every P. 

For, let 7i,..., 7m be the irreducible components of X, so 
that X is equivalent to 7 in (10). The first part of our theorem 
follows from Theorem 4. To prove the last two parts, we may as¬ 
sume that the 7 i are chosen so that the first n are equal, the 
following r^ are equal, etc. Then let 7 i be of order ti. 

The numbers v and w remain unchanged if we replace X by 7. 
Let P be a constant matrix commutative with 7. We may write 


Pu 

Pi* 

* • ‘ Plm 

P ml 

PmSt 

• • • P fnm 


where the matrix Pa has ti rows and columns. Then 7P = P7 
if and only if 


7i Pii = Pii Yi {i, 3 = I,...,m). . 

If Yi and 7/ are not equal and hence not equivalent, then 
Pii = 0 by the Lemma. If they are equal, the corollary in §141 
shows that Pa = e Fi, where Fi=> la is the identity matrix of 
order U = tj. Hence 
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CuPi •• 

‘ CiriPl 

0.0. 


Cril • * 

* Criri -^1 

0 . 0 

p = 

0 •• 

• 0 

dll F ,^+1 • • • F 


Q.... 

... 0 

dr2l F fj-Hl ' * * dr2r2 F ri-|-l 


Conversely, its origin shows that every such matrix P is com¬ 
mutative with X. The number of its arbitrary parameters Cjy, 
dhn ,... is 0 = ri* + r 2 ®‘ +-h ri\ 

finally, let Q be a constant matrix commutative with X and 
with every P. Since Q is commutative with X, it may be derived 
from P by replacing the c, d,... by C, D,.... Since Q shall be 
commutative with every P, matrix (Ca) must be commutative 
with every matrix (c<,), and (Dim) with every (dim), etc. Hence 
(Cij), (Dim ),... are products of identity matrices of orders 
ri, r*,... by constants. These are also suflSleient conditions that 
Q be commutative with X and with every P. The number w of 
arbitrary parameters in Q is evidently I 

143. Regular group matrix. With any group (? = {si,..., s,} 
is associated a p-rowed square matrix X whose element in the «th 
row and jth column is x with the subscript s< Sj-K Hence X 
= ^M, x„ summed for all the elements s of G, where M, denotes 
the matrix whose element o-,-,- in the ith row and jth. colunm is 1 
or 0 according as S{ s,“^ is or is not equal to s. Similarly, ilf * is the 
matrix whose element t/j in the yth row and Atii column is 1 or 0 
according as SjSi,-^ is or is not equal to t. In the product M, M t, 
the element in the ith row and i:th column is ira = o’ii'rjk. 
Unless the product o-t is zero, it is 1 and 

Si = S, Sj sj-i = t, 

whence Sk~^ = st. Hence if the latter equation does not hold, 
then vib — 0. If it holds, the single term of irtk having a value 
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0 is that for which j is determined by Sj = tsu, and then 
Tik = 1. Hence M,Mt = M,t, 

If Si is the identity of G, evidently is the gf-rowed identity 
matrix. This completes the proof of 

Thbobbm 9. X ^ a group matrix corresponding to 0. 

It is called the regidar group matrix. When G is a regular sub¬ 
stitution group ( §90), the method given at the beginning of §140 
to represent G as a linear group leads to our present matrices M,f. 

Thbobem 10. Arrange the irreducMe components Xi of the 
regular group matrix X so Owt no two of Xi,... ,Xc are equivalent, 
while every further component is equivalent to one of &iese c com¬ 
ponents. If Xi is of order fi and, index e< (so fhai Xi is one of exacAy 
Oi equivalent components of X), then ft = eifor every i, and c is the 
number of classes of confugate demenis of the group G of order g. 

Counting the number of rows in two ways, we have 
Oifi + • • • H- Cc/« = ff* 

Since each element of X is one of the g variables x, the first part of 
Theorem 8 gives 

/i»-I- • 

Let F be a matrix independent of the x’s. Let [si, s^] denote the 
element in the tth row and jth column of Y. By considering the 
element in the ith row and A;th column of XY and that of YX, 
we see that X and F are commutative if and only if 

2 [Sft ^ (i, ft = 1 ,..., g), 

3 t 

identically in the x^s. The two indicated have equal subscripts 
i£ St — Si sr^ sjfc. By their coefficients, 
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Define by Si = s„ s/. Then 

[®>’j Sfc] “ Sn®*] (J) k, n.— 1, . .., g). 

If 8i is the identity of G, write y,for [Si, Si]. Then 
= [sr^Si, si] = [So s,], Y = 

Since this matrix Y involves g independent parameters y, the 
second part of Theorem 8 gives 

ei® + • • • + Ce® = g. 

Hence 

(ei-/i)®+ ••■ + (e«-/c)® = g-2(7 + g = 0, 
so that ei=‘Ji for every i. 

A matrix Z independent of the x’a which is commutative with 
X is of type Y. Since Y differs from X only in the,order of mul¬ 
tiplication of subscripts, a matrix commutative with every Y 
is of type X. Hence Z is commutative with X and with every Y 
if and only if its element in the ith row and jth column is simidta- 
neously of the forms 

(i, j = l,...,g). 

Define s by s< = s,- s. Thus «, = z* if and only if < = s,- s and 
hence if s and t axe conjugates in G. By the final part of Theorem 
8 , the number c of independent parameters in Z is the number of 
classes of conjugate elements of G. 

If possible, let there be an irreducible group matrix ff of order A 
corresponding to G which is not equivalent to one of the c irre¬ 
ducible components Xi,... ,Xc of the regular group matrix. 
The first part of Theorem 8 shows that in the group matrix (Xi,j) 
having the irredudble components Xi,..., Xe, H, 


/i®+ ••■+/c® + A® = y + A® 
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ofthefunotionsa:<,-of a!,j, ..., a:, linearly independent, whereas 
evidently not more than g functions of them are linearly independ¬ 
ent. This proves 

Thboeem 11 . Corresponding to a group 0, ihe total number of 
irreducible group matrices no two of which are equivalent is equal to 
(he number c of classes of conjugate elements of 0. 

144. Group characters. Consider a representation of G — {si 
= I, Ss,..., Sj} as a group of linear transformations with the 
non-singular matrices Jf.,. Denote the trace (sum of the diagonal 
elements) of the latter by x (s<)- The set of g numbers % (si),..., 
X («») is called a character of G corresponding to this representation 
or to the group matrix X x,. belonging to it. 

For the 4rrowed group matrix X in the example in §140, M,i 
is the identity matrix, whence x(si) = 4; while M,^ is obtained 
from X by taking % = 1, aSi = 0(t 2), whence xM = 0. We 

saw that fa = 2 0 ) 2 , where z = Xl + Xz ~ Xt — xt. If we employ 
this new group matrix (s) having a single element 2 , we obtain the 
new character x(si) = xCss) = 1, xM = xM = - 1. 

Since equivalent matrices have the same trace, to equivalent 
representations or equivalent group matrices correspond the same 
character. By Theorem 11, there exist irreducible group matrices 
Xi ,..., Xo, no two of which are equivalent, such that every 
irreducible group matrix is equivalent to one of them. The 
corresponding (simple) charactersare denoted by xi(si),. •. ,Xe(s{)- 

If among the irreducible components of the group matrix X, 
Xj occurs exactly tj times (r# 2 0), evidently 

(11) x(s<) = n Xi(s<) +-h n X«(s<). 

Let u and 0 be two distinct numbers of the set 1 ,... c, and write 
s • ■ 

f ^ 1—1 ^ f ) 
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Xu (R) — CtHRj Xv (JR) — 23 

i p 

In (6) replace i by m and k by n, multiply the resulting equation 
by aims ajfcnT and sum for m, ti = 1,. .., We get 

f f g 

E23 GiimS '23 ^knT ClnlR “ ^23 ^ in ^»otS OthnT* 

R TO—1 n—1 J m, n 

The summand on the right is zero unless m I, n — j. Since 
As jIr-i = Asr- 1 , we may perform the summations on the left and 
obtain 

(12) ^2 dklTIt = ? fltijs dkjT^ 

R J 

From (7) we obtain in the same manner 

(13) 23 ^i/SR-l &pflTR — 0. 


In (12) take j = i, I = k, T = I; we get 

E q 

ttttSR-l flfc&R = “ttijbs djeu 
R J 

Sum for i and ft, and mark f with the subscript u. Thus 
(14) E (SR-^)xu (R) = ^J Xu iS). 

R Ju 


Since Xu(I) =® /», the case S = I m 


= flf. 
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Similarly, in (13) take j = i, q — p, T >= I, and sum for i, p. 
We get 

(16) 52 Xu (.SB~^)xv (12) = 0 (ut^ v). 

R 

We return to the general group matrix X to which corresponds 
the character. x(s<) for i = 1,..., g. Multiply (11) by x*(si"^) 
and sum for i = 1,..., g. We get 

0 eg 

22x(si)xu(si“‘) =2 S'"® x®(s<)x»(sr0- 

,<-X «-l i-l 

For V 7 ^ u, the inner sum on the right is zero by (16) with S = I. 
For » = w, it is »•„ g by (16). Hence r« is determined by the values 
of x(s<) s-od the.fixed numbers x«(srO- Bot r,- is the number 
of times occurs as an irreducible component of X. This proves 

Theorem 12. Tvm representations of G as linear groups are 
equivalent if and mdy if their corresponding characters are the same. 

145. Applications to group matrices. Let G be a conunutative 
group, and X = 22 be a corresponding irreducible group 

matrix of order/. Since M is commutative with X, it is* of the form 
Ci If by the corollary in §141. If Si is of order e*, M‘* <= I/, whence 
Ci is an e<th root of unity. If / > 1, X = 22 would be a 

reducible matrix. 

Theorem 13. An irreducible group matrix corresponding to a 
commutative group is of order unity. 

See the example at the end of §140. 

It follows from Theorem 6 that every representation of a com¬ 
mutative group of order g is equivalent to one whose matrices 
have gth roots of unity in the diagonal and zeros everywhere out¬ 
side the diagonal. This property therefore holds for the special 
case of a cyclic group and proves 
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Theobbm 14. The trace of any linear tran^ormation of finite 
order is a sum of roots of unity. 

Thbokbm 15. The order of every irreducible group matrix of G 
is a divisor of the order g of Q. 

Write S, = 1 or 0 according as s = I or s 5^ J. Then 

E = x(S), 

B 


SO that (14) may be written in the form 

i: - x(Sii:-o}= 0. 

Since x(-f) = / 0, the determinant 


- x(-SE-0 


= 0 


[S,R = si,..., Sg). 


We saw that the trace x of SB~^ is a sum of roots of unity and 
hence is an integral algebraic number. Thus g/f is a root of an 
equation of degree g whose leading coejficient is unity and remain¬ 
ing coefficients are integral algebraic numbers. Hence^it is an 
integral algebraic number. Being at the same time a rational 
number, it is a rational integer. 


146. The alternating group G on five letters. By Ex. 6, §111, 0 
has exactly five sets of conjugate substitutions. Let the five repre¬ 
sentations of (? as irreducible linear groups have the orders 1, a, b, 
c, d. They are divisors of 60 and the sum of their squares is 60. 
Their maximum is evidently 6. If d = 6, a® -f 6* + c® =23, 
which is impossible in integers. Let d = 5. Then a* + 6* -h c® 
= 34. If c = 6, o® + 6® = 9 and a or 5 is zero. If c = 4, o = 5 
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= 3. Not all of a, b, c are g 3. Filially, if a, 6 , c, d are all g 4 not 
all are 4, while 1 + 4* + 4* + 4* + 3® = 58 < 60. ’ 

Thbobem 16. Every representoHon of the dUemating group on 
five letters as an irreducible linear group is eguivdlewt to one on 1 3 
4, or 5 variables, there being two types on 3 variables. ' ’ 

147. Computation of group characters. In (12) take ft = j, 
I = i, and sum for i and j; we get ’ 

(17) /E xiSR-^ TB) = gxiS)x(T). 

R 

Since the traces of Ag and Ag Ar pe equal, 

(18) xm) = xiSR). 

Consider the sjmimetric group on 3 letters and write 

E = identity, A = (132), S = (123), C - (12), D « ( 13 )^ 

F = (23). 

The three sets of conjugates are E; A, B; C, D, F. By (ig), 

x(A) = x(B), x(C) = x(J>) = x{F). 

By (17) for T ^ E, x(E) ^ f. By (17) for S ^ T = A: 
S — T = C; S = A, T — Cia turn, and by (15), we get 


3/* + 3/ x(A) = 6xHA), ^ + 4f x.(A) = 6x»((?), 
6 /x(C) = 6x(A)x(C'), f + 2x*(A) + 3x*(C) = 6 . 
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If x(C') ^ Oj relation gives x(A) — Si l^he other 

relations give xHC') =/* = !• Since the order/of the irreducible 
group matrix is positive, we have 

/=!, x(^) = x(^) = l, x(C) = ±l. 

If %(<?)= 0» tide conditions reduce to x(-^) = ~ 4/i ® 

whence 

/ = 2, xVS) = 2, x(A) = - 1, x(.C) = 0. 
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